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i Introduction 



1.1 Goals in this course 

These are my hopes for the course. Let me know if you feel the course 
is not fulfilling them for you. Be free with your criticism & comments! 
Thanks. 

• Teach basics of collective phenomena in electron systems 

• Make frequent reference to real experiment and data 

• Use 2nd quantized notation without field-theoretical techniques 

• Get all students reading basic CM journals 

• Allow students to practice presenting a talk 

• Allow students to bootstrap own research if possible 

1.2 Statistical mechanics of free Fermi gas 

1.2.1 T = Fermi sea 

Start with simple model of electrons in metal, neglecting e~ — e~ interac- 
tions. Hamiltonian is 

/i 2 V 2 

H = -Y, L : j = 1, ... TV particles (1) 

j ZTfl 

Eigenstates of each — (7i 2 V 2 /2m) are just plane waves e lk r labelled by k, 
with ki = 27Trii/Li in box with periodic B.C. Recall electrons are fermions, 
which means we can only put one in each single-particle state. Including 
spin we can put two particles (||) in each k-state. At zero temperature 
the ground state of A^-electron system is then formed by adding particles 
until we run out of electrons. Energy is = h 2 k 2 /2m, so start with 
two in lowest state k = 0, then add two to next states, with k x or k y 



2 



or k z = 27r/L, etc. as shown. Energy of highest particle called "Fermi 
energy" £p, magnitude of corresponding wave vector called hp- Typical 
Fermi energy for metal ep — leV — 10 A K. At T = only states with 
k < kp occupied (Fermi "sea" or Fermi sphere), so we can write density 
of electrons as 2*# occupied states/ Volume (2 is for spin): 

2 k Jl „ r d 3 k 1 rkj? , o ,, k 3 



n " L 3 £ ~ 2 ^ (27r)3 " tt2 /o ^ - 3^2 



SO 



/c F = (37r 2 n) 1/3 01 £ F = 



/i 2 (37r 2 n) 2 / 3 



2m 



(2) 



(3) 



in other words, nothing but the density of electrons controls the Fermi 
energy. 




8 r 



\ 1 — I — I— ► 



Figure 1: States of Fermi gas with parabolic spectrum, e = k 2 /2m. 



The total ground state energy of the Fermi gas must be of order £p, 
since there is no other energy in the problem. If we simply add up the 
energies of all particles in states up to Fermi level get 



E 
L3 



1 rk 



7T 



2 JO 



dk t 



(h 2 k T 



h 2 k F 



2m 



IOtt 2 



m 



(4) 



and the ground state energy per particle (N = nL 3 is the total number) 
is 

(5) 



= -£p 

N 5 
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1.2.2 T > Free energy. 

Reminder: partition function for free fermions in grnd conical ensemble is 
Z = Tre-K*-^ (6) 



(7) 
(8) 



= E (ni : n 2 ...n OQ \e ^ H fiN) \ni 1 n 2 ...n 00 ) 

ni,n 2 ...n koQ 

E (ni : n 2 ...n 00 \e~ p ^ i[£ini ~^\ni : n 2 ...n 00 ) 

ni,n 2 ...noo 

where i labels single-fermion state, e.g. i = k, a, and rij runs from to 
1 for fermions. Since many-fermion state in occ. no. representation is 
simple product: |ni, n 2 ...n 00 ) = |ni)|n2)...|rioo), can factorize: 



Z= |Ee 



-/3[eini-Ami] 



Ee 



-/5[£oonoo-/inoo] 



(9) 



oo 



SO 



Z = ng (l + e-Pte-ri) 



(10) 



Since the free energy (grand canonical potential) is Q = —ksT logZ, we 
get 



Q = -jfeflTEg! log (l + e-Pte-ri) 



11) 



1.2.3 Avg. fermion number. 

We may want to take statistical averages of quantum operators, for which 
we need the statistical operator p = Z~ l e~^ H ~^ N \ Then any operator 
O has an expectation value (O) = Tr(pO). For example, avg. no. of 
particles 



(N) = Tr(pN] 

Tr( e -/?(^-^)) 



(12) 
(13) 
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Now note this is related to derivative of Q wrt chem. potential ji\ 



dtt 



and using Eq. 11, we see 



-k B T 

-1 



d\ogZ -k B TdZ 



Z 



dfj, Z 
Ti(pN) = -(N) 



oo 



1 



oo 



(14) 
(15) 

(16) 



where Uq is the avg. number of fermions in a single-particle state i in 
equilibrium at temperature T. If we recall i was a shorthand for k, cr, but 
£k doesn't depend on cr, we get Fermi- Dirac distribution function 



n 



k(7 



1 

I _|_ e /3(£k-A*) 



(17) 



1.2.4 Fermi gas at low T. 

Since the Fermi energy of metals is so high (~ 10 4 i^), it's important to 
understand the limit k B T C where the Fermi gas is nearly degen- 
erate, and make sure the classical limit k B T ^> ep comes out right too. 
Let's calculate, for example, the entropy and specific heat, which can be 
obtained from the thermodynamic potential Q via the general thermody- 
namic relations 

dQ\ „ m (dS" 



S = - 



From (11) and (16), and including spin, we have 

Q = -2k B T^og(l + e-^-^) 
k 

= 2k B TL 3 J deN(e) log (l + e^ - ^) 



= 2 kjfL« N ®{-w)r (19) 



where I introduced the density of Instates for one spin N(e) = £~ 3 ^ k £(e- 
Ek). The Fermi function is f{e) = 1/(1 + exp P(e — fi)), & I defined shifted 
energy variable £ = e — fj,. In general, the degenerate limit is characterized 
by k sums which decay rapidly for energies far from the Fermi surface, so 
the game is to assume the density of states varies slowly on a scale of the 
thermal energy, and replace N(e) by N(ef) = Nq. This type of Sommer- 
feld expansion 1 assumes the density of states is a smoothly varying fctn., 
i.e. the thermodynamic limit V — » oo has been taken (otherwise N(e) is 
too spiky!). For a parabolic band, = h 2 k 2 /(2m) in 3D, the delta-fctn. 
can be evaluated to find 2 



(22) 

This can be expanded around the Fermi level: 3 

N(0 = N(0) + N'(0)( + \N"(0)e + ... (24) 
(In a horrible misuse of notation, N(0), N(ep), and Nq all mean the 

1 If you are integrating a smooth function of e multiplied by the Fermi function derivative —df/de, the derivative 
restricts the range of integration to a region of width k B T around the Fermi surface. If you are integrating something 
times /(e) itself, it's convenient to do an integration by parts. The result is (see e.g. Ashcroft & Mermin appendix 
C) 




f 

J — I 



/V 00 d 2n ~ l 

dsH(e) + J2^(k B T) 2n -—^H(e)\ £ ^ (20) 
-°° „=i a£ 



where a n = (2 - l/2 2(n -^) C(2n) (C is Riemann ( fctn., C(2) = tt 2 /6, C(4) = tt 4 /90, etc.). 
2 Here's one way to get this: 



at/ x t-1 r, \ f d 3 k de \/2me. f k 2 dk ( m \ 



d 3 fc , de ,,.„ \j2me. f k 2 dk ( m \ ., \J1me. 3 n / e V' 2 /„,n 
— * - = — ' - ^ ^ - ~ ' 2-(-j (21) 



3 When does the validity of the expansion break down? When the approximation that the density of states is a 
smooth function does, i.e. when the thermal energy fcgT is comparable to the splitting between states at the Fermi 
level, 

h 2 k F Sk 5k a 

0£k Up — ~e F — ~£p-, (23) 

m k F L 

where a is the lattice spacing and L is the box size. At T — IK, requiring k B T <~ fc, and taking e F /k B — 10 4 K 
says that systems (boxes) of size less than \\ivn will "show mesoscopic" effects, i.e. results from Sommerfeld-type 
expansions arc no longer valid. 
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Figure 2: Density of states for parabolic spectrum, e = k 2 /2m 

density of states at the Fermi level). The leading order term in the low-T 
specific heat is therefore found directly by scaling out the factors of T in 
Eq. (19): 

cv ^ 2k B iiV (=|) e = 2k B T JZdx(^y (25) 

7T 2 /3 

So 

2tt 2 

c v ~—N k B T + O(T 3 ). (26) 
This is the famous linear in temperature specific heat of a free Fermi 

4 

gas. 



4 Note in (26), I extended the lower limit — /i of the integral in Eq. (19) to — oo since it can be shown that the 
chemical potential is very close to ep at low T. Since we are interested in temperatures fc^T <C Sf, and the range 
in the integral is only several ksT at most, this introduces neglible error. 

Why: 

At T—0, the Fermi function n£ — > step function 6(jj, — £k) 5 so we know /j,(T = 0) must just be the Fermi energy 
e F = h 2 (3n 2 n)y 3 /2m. 

k 



2 J deN{e)f{e) 

/ deN(e) + —(kBT) 2 N'(e)\ £=tl (continued on next page) 
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1.2.5 Classical limit. 



I won't calculate the classical limit. All the standard results for a Boltzman 
statistics gas, e.g. cy{T ^> sf) = (3/2)NkB follow immediately from 
noticing that the Fermi function reduces to the Boltzman distribution, 



(You will need to convince yourself that the classical result ^/(ksT) —> 
— oo is recovered to make this argument.) 

1.3 Second quantization 

The idea behind the term "second quantization" arises from the fact that in 
the early days of quantum mechanics, forces between particles were treated 
classically. Momentum, position and other observables were represented 
by operators which do not in general commute with each other. Particle 
number is assumed to be quantized as one of the tenets of the theory, e.g. 
Einstein's early work on blackbody radiation. 

At some point it was also realized that forces between particles are also 
quantized because they are mediatedby the exchange of other particles. In 
Schrodinger's treatment of the H-atom the force is just the classical static 
Coulomb force, but a more complete treatment includes the interaction 
of the H-atom and its constituents with the radiation field, which must 
itself be quantized ("photons"). This quantization of the fields mediat- 
ing the interactions between matter particles was referred to as "second" 
quantization. In the meantime, a second-quantized description has been 
developed in which both "matter fields" and "force fields" are described 



f(e) ^ e-^ £ -»\ T 



oo. 



(27) 




n 



~e F -—{k B T) 



2 N'(e F ) 
N(e F ) 



Since N' 



/N is typically of order 1/ep, corrections are small. 



8 



by second-quantized field operators. In fact, modern condensed matter 
physics usually does go backwards and describe particles interacting via 
classical Coulomb forces again, 5 but these particles are described by field 
operators. Once the calculational rules are absorbed, calculating with 
the 2nd-quantized formalism is easier for most people than doing 1st- 
quantized calculations. They must, of course, yield the same answer, as 
they are rigorously equivalent. I will not prove this equivalence in class, 
as it is exceedingly tedious, but merely motivate it for you below. I urge 
you to read the proof once for your own edification, however. 6 

1.3.1 Symmetry of many-particle wavefunctions 

Quantum mechanics allows for the possibility of indistinguishable parti- 
cles, and nature seems to have taken advantage of this as a way to con- 
struct things. No electron can be distinguished from another electron, 
except by saying where it is, what quantum state it is in, etc. Inter- 
nal quantum-mechanical consistency requires that when we write down 
a many-identical-particle state, we make that state noncommittal as to 
which particle is in which single-particle state. For example, we say that 
we have electron 1 and electron 2, and we put them in states a and b re- 
spectively, but exchange symmetry requires (since electrons are fermions) 
that a satisfactory wave-function has the form 

*( ri) r 2 ) = A[(^ a (r 2 )^(n) - V a (ri)^(r 2 )]. (28) 

If we have N particles, the wavefunctions must be either symmetric or 
antisymmetric under exchange: 7 

^ s (ri . . . r 2 . . .Yj . . .y n ) = ^ B (ri . . . r 7 . . . r, . . .y n ) Bosons (29) 
\I/ F (ri . . . Yi . . . Yj . . . yn) = — \I/ F (ri . . . Yj . . . Yi . . . yn) Fermions (30) 

5 Q: when does this approximation break down? 

6 See, e.g. Fetter & Wallecka, Quantum Theory of Many-Particle Systems 

7 Rccently, a generalization of Bose & Fermi statistics to particles called anyons has been intensely discussed. 
Under exchange an anyon wavefunction behaves as ^(ri . . . . . . . . . rjv) = e t9l i' A (ri . . . rj . . . . . . rjv) for some 

< 9 < 2tt. 



9 



Given a set of single-particle wave functions <fe z (r), where Ei is a quan- 
tum number, e.g. energy, we can easily construct wave fctns. which satisfy 
statistics, e.g. 



B 

\jy f 



m...n 00 



(r b ...rjv) = 



'n\\n2\ 



E (±1) SgnV V nf =1 ^(rO (31) 



^€{^1,^2...^} 



Remarks on Eq. (31): 

• sum all permutations of the ^ 'sin product (^^(ri) (r 2 ) . . . ())e n (yn) 8 

• # distinct i^'s occuring may be less than N, some missing because 
of multiple occupation in boson case. Example: 

particles 



20100. ..0 

(ri,r 2 ,r 3 ) = 



^{<M ( r l)0Si( r 2)0S 3 ( r 3) + 
+0£ 3 ( r l)0£i( r 2)0£i(r3) + 

+0Si(ri)0E 3 ( r 2)0Ei(r3)} 

■0 ~~ 

Fig. 2. Possible state of 3 noninteracting Bose 

• Completely antisymmetric Fermionic wavefunction called Slater de- 
terminant: 



ni...7ioo 



(r b ...riv) = 



1 \ x / 2 



.AT! 



(32) 



where there are N eigenvalues which occur between E m i n and E. 
inclusive, corresponding to N occupied states. 



max-i 



8 You might think the physical thing to do would be to sum all permutations of the particle labels. This is correct, 
but actually makes things harder since one can double count if particles are degenerate (see example of 3 bosons 
below.) The normalization factor is chosen with the sum over all permutation of the E^s in mind. 
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1.3.2 Field operators 

2nd quantization is alternative way of describing many body states. We 
describe a particle by a field operator 

W^a^W (33) 

i 

where % runs over the quantum numbers associated with the set of eigen- 
states </>, di is a "coefficient" which is an operator (I'm going to neglect 
the hats (" ) which normally denote an operator for the a's and a^'s), and 
<f>E i is a "lst-quantized" wavefunction, i.e. a normal Schrodinger wave- 
function of the type we have used up to now, such that (for example) 
H<j)E i = EiCpE,- Now we impose commutation relations 

'^{r),^(r f )] ± = 5{r-r') (34) 
^(r),^)^ = ^( r )^t( r /)j^ = , (35) 



± 



which implies 

[a h a]]± = 8 tJ ; [a u a 3 } ± = [a], a]]± = 0. (36) 

The upper sign is for fermions and the lower for bosons in Eqs. (35) and 
(36). 

Now construct many-body states from vacuum (no particles) |0). a called 
annihilation operator, a) creation operator (see below). 

Examples & comments {all properties follow from commutation rela- 
tions): 

Bosons : 

• one particle in state i: aj|0) = 

• annihilate vacuum a^O) = 

• (Bosons) 9 ajaj|0) = |2), 

9 Analogous state for fermions is zero, by commutation relations-check! 
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• (Bosons) two in i and one in j: ajaja]|0) = |2)j|l)j 

• a\a,i = hi is number operator hi state i. 
Proof: (bosons) 

(a\ai)\n)i = (aja z )(aj) n |0) 

= 4(1 + aU^alr-'lO) = \n) l + (aj) 2 a t (al) n ^) 
= 2\n)i + (al)\(al) n - 2 \0) ... = n\n)i 



Similarly show (bosons) 



.10 



a\\n)i = (n+ l) 1/2 |n+ 1), 



cii\n)i = n l / 2 \n — 1), 



many-particle state 

1 \V2 



v ni!n2' 



(a t i) ni (4)' !2 '--|0> = |ni,n2...n O o> (37) 



oo- 



★ occupation numbers specify state completely, exchange symmetry 
included due to commutation relations! (normalization factor left for 
problem set) 



Fermions 

• Anticommutation relations complicate matters, in particular note 

a 2 = (a)) 2 = rii = (Pauli principle) (38) 

10 By now it should be clear that the algebra for the bosonic case is identical to the algebra of simple harmonic 
oscillator ladder operators. 
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so 



at 


0> = 


= 11} 


a|l) = 


10} 


a 


f ll> 


= 


a|0) = 


= 



(39) 



• many-particle state 

(oIr(4)" 2 • • • |0> = Kna-.-noo) 
* note normalization factor is 1 here. 

• action of creation & annilation operators (suppose n s = 1): 
o.| .../«„.. .) = (-l)»i-H»+-»-i(al)»i ■ ■ ■ a s at ■ ■ ■ (al) n °°\0) 



(40) 



= (_ 1 )ni+n 2 +...n s _ 1 ( a t)n, 
= (_^™l+ n 2+- n »-l 



.(l-aJa J )...(at o )"»|0> 

=0 

n s - 1 . . .) 



also 

a s |...0...) =0 (41) 

and similarly 

at|. ..„....)= (42) 



1.3.3 2nd- quantized Hamiltonian 

★ Point: Now "it can be shown" 11 that state vector 

|^K>) = E /(ni,n 2 ...n 00 ,t)|ni,n 2 ...n 00 ) (43) 

ni,n 2 ...noo 

11 Normally I hate skipping proofs. However, as mentioned above, this one is so tedious I just can't resist. The 
bravehearted can look, e.g. in chapter 1 of Fetter and Wallecka. 
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satisfies the Schrodinger equation (our old friend) 



dt 

if we take the " 2nd-quantized" form of H 



ih^-mt)) = H\V(t)) (44) 



H = J2al(i\T\j)a 3 + - £ ala](ij\V\k£)a £ a k (45 

ij ^ ijkl 

= J d 3 rtp\r)T(r, V r )^(r) (46) 
+-// d 3 r dh'^\Y)^\r')V{Yy)^{Y')^{Y) (47) 
where the 1st quantized Hamiltonian was H = T + V '. 
Translationally invariant system 

It may be more satisfying if we can at least verify that this formalism 
"works" for a special case, e.g. translationally invariant systems. For 
such a system the momentum k is a good quantum number, so choose 
single-particle plane wave states 

4>i(r) = <Mr) = £- 3/ V k ' r « ff , (48) 

where u a is a spinor like u± = (J), etc. 1st quantized T is — V 2 /(2m), 12 
so 2nd-quantized kinetic energy is 



f = Y,al(i\T\j)a 3 = £ (^— \ a ] ka a ka 

ij ka \^Tfl 



(49) 



Since we showed a^a^ is just number operator which counts # of par- 
ticles in state k<r, this clearly just adds up the kinetic energy of all the 
occupied states, as it should. For general two-particle interaction V, let's 
assume V(r, r') = V (r — r') only, as must be true if we have transl. 
invariance. In terms of the Fourier transform 

V(q) = 4 / dh e* qr V(y) (50) 
L 6 J 



12t; 



I'll set Ti — 1 from here on out, unless required for an honest physical calculation. 
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we have (steps left as exercise) 




1 



./1/(q)a k / a ak+ q( 7 



(51) 



V 



+qo" 



1.3.4 Schrodinger, Heisenberg, interaction representations 

Here we just give some definitions & reminders on different (equivalent) 
representations of quantum mechanics. In different reps., time depen- 
dence is associated with states (Schrodinger), operators (Heisenberg), or 
combination (interaction) . 

• Schrodinger picture 



state 




operators Os ^ Os(t) 
H\i>s(t)) 



U 

i—\i/j s (t)) 



has formal solution 




(52) 



Note H hermitian =^> time evolution operator \ U = e 



iH(t-t ) 



is uni- 




(where Hq usually soluble) 




e >H tQ -iH t 



=► *»il&(*)> 



H'(t)\Mt)) 
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with H'(t) = e l&ot H f e- lAot 



Remarks: 



— states and ops. t-dependent in interaction picture, but time de- 
pendence of operators very simple, e.g. 

t 

k 

i^-a kI (t) = e^[a k ,^o]e-^ ot 
at 



au(t) = a k e^ 
Time evolution operator determines state at time t: 

Mt)) = u(t,to)\Mto)) 

From Schrodinger picture we find 



U(t,t Q ) = e iHot e- iH ^- to) e- iHoto 



(Note ([H,H Q ] 7^0!) 

Heisenberg picture 

state \iPh) ^-independent, 
operators 6 H (t) = e im 6 s e~ iHt 

so operators evolve according to Heisenberg eqn. of motion 




-k Note-compare three reps, at t = 0: 

\4> H ) = \f s (o)) = \Mo)) 
6s =o H (Q) =6/(0) 



(53) 



(54) 



(55) 
(56) 
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1.4 Phonons 

1.4.1 Review of simple harmonic oscillator quantization 

I will simply write down some results for the standard SHO quantization 
from elementary QM using "ladder operators" . We consider the Hamilto- 
nian 

H = Kj + ~<1 2 (57) 
2M 2 H K J 

and extract the relevant dimensions by putting 

2 K 

UJ = 



i = q 



M 



8 

-i— = p(hMu)- l/2 (58) 



so 



H = 



( d 



2 



2 



d ^ey (59) 

We recall soln. goes like e~^/ 2 i/ n (£), where H n are Hermite polynomials, 
and that eigenvalues are 

E n = hu{n + 1/2) (60) 
Define ladder operators a, a) as 

1 /. d 



a = 



A* + ir) (61) 



Ladder ops. obey commutation relations (check) 

[a, a*] = 1 ; [a, a] = ; [a^,a^]=0 (63) 
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X 



eq 



x 



eq 
1+1 



K K K 



i+1 



eq 



Figure 3: Linear chain with spring couplings K. Dynamical variables 

& then H may be written (check) 

H = huo [a ] a + -j . (64) 
a, a) connect eigenstates of different quantum nos. n, as 

W = (^i72l°)' ( 65 ) 

where |0) is state which obeys a\0) = 0. Operating on \n) with may 
be shown with use of commutation relations to give 

o)\n) = (n+ l/2) 1/2 |n+ 1> ; a|n) = n 1/2 |n - 1) (66) 

so with these clefs, the ladder operators for SHO are seen to be identical 

to boson creation and annihilation operators defined above in Sec. 1.3.2. 



1.4.2 ID harmonic chain 

If we now consider N atoms on a linear chain, each attached to its neighbor 
with a "spring" of spring constant K as shown in figure. First let's consider 
the problem classically. The Hamiltonian is 

H = Z^ + ^(qi-qm)\ (67) 
i 2m 2 
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where the qgs are the displacements from atomic equilibrium positions. 
Now Hamilton's eqns. (or Newton's 2nd law!) yield 

—Mqj = Mu; 2 q 3 = K(2q 3 - q 3 _ x - q j+1 ). (68) 

A standing sinusoidal wave q 3 = Acos(kaj) satisfies this equation if the 
eigenfrequencies have the form 

wjj = J2(1-cobA;o), (69) 

where if a is the lattice constant, k = 2n/X. Note that for small k, is 
linear, u k ~ (K /M) l / 2 ka. l?> 

This is the classical calculation of the normal modes of oscillation on 
a ID chain. To quantize the theory, let's impose canonical commutation 
relations on the position and momentum of the i\h and jth atoms: 

[qe,Pj] = ihS im (70) 

and construct collective variables which describe the modes themselves 
(recall k is wave vector, t is position) : 

1 1 

qt = -j^yj^ E e lka£ Qk ; Qk = ~^YJ2 E e~ lka£ qe 

Vt = ^^ pk . P, = ^Ee^, (71) 
which leads to canonical commutation relations in wave vector space: 



,m 



= | Ee _ w( ,_ fc0= (?2) 

iV f 

Let's now express the Hamiltonian (67) in terms of the new variables. We 
have, with a little algebra and Eq. (69), 

Hv\ = Y,P k P-k (73) 

i k 

K K M 

^Efe-^-i) 2 = T E^(2-e ik -e- & ) = - X<4QkQ-k 

* £ * k £ k 
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Note since k = 2-Knj (No), the tUk are independent of system size 
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so 



H = ^—Y. PkP-k + E ulQkQ-k 
2M k £ k 



(74) 



Note that the energy is now expressed as the sum of kinetic + potential 
energy of each mode /c, and there is no more explicit reference to the 
motion of the atomic constituents. To second quantize the system, we 
write down creation and annihilation operators for each mode k. Define 



ak = 



a\ — 



(Muj k \ 




V 2h ) 




(Muj k \ 









Qk + 



(Q-k- 



M(j k 
i 



Pi 



(75) 
(76) 



which can be shown, just as in the single SHO case, to obey commutation 
relations 



a k , a\i 



= 5 



kk' 



W,a k ] = 



(77) 
(78) 
(79) 



and the Hamiltonian expressed simply as 

E hw k [a\a k + - j (80) 

which we could have probably guessed if we had realized that since the 
normal modes don't interact, we have simply the energy of all allowed 
harmonic oscillators. Note this is a quantum Hamiltonian, but the energy 
scale in H is the classical mode frequency uj k . 



1.4.3 Debye Model 

Let us imagine using the Hamiltonian (80) as a starting point to calculate 
the specific heat of a (3D) solid due to phonons. We take the Debye model 
for the dispersion to simplify the calculation, 

ck k < kj) 
k> k D 



(81) 
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where the Debye wave vector hp = (Q^n) 1 / 3 is obtained by replacing the 
first Brillouin zone of the solid by a sphere of radius kp which contains 
N wave vectors, with N the number of ions in the crystal. The average 
value of the Hamiltonian is 

U = (H) = 3E^ k (<ak> + I) = 3Efi^ (^31 + I) ( 82 ) 

since the average number of phonons in state k is simply the expectation 
value of a boson number operator 

(a\a k ) = Tr (pa\a k ) = b(hu k ), (83) 

where b(x) = (exp(/3x) — l) -1 is the free Bose distribution function. The 
factors of 3 come from the 3 independent phonon polarizations, which we 
consider to be degenerate here. Taking one derivative wrt temperature, 
the spec, heat per unit volume is 



du 



cv = 



= d hck d he fkD k 3 

n dT k e^ ck - 1 dT 2tt 2 ^ e? hck - 1 



dT 

d 3{k B T) A f0 o x 3 = d 7T 2 {k B T) A = 2tt2 (k B T\ 3 
~ dT27T 2 (hc) 3 ^ e x -l dTIO (he) 3 5 b [he ) [ } 

7T 4 /15 

So far we have done nothing which couldn't have been done easily by or- 
dinary lst-quantized methods. I have reviewed some Solid State I material 
here by way of introduction to problems of interacting particles to which 
you have not been seriously exposed thus far in the condensed matter grad 
sequence. The second quantization method becomes manifestly useful for 
the analysis of interacting systems. I will now sketch the formulation (not 
the solution) of the problem in the case of the phonon-phonon interaction 
in the anharmonic crystal. 
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1.4.4 Anharmonicity & its consequences 

As you will recall from Solid State I, most thermodynamic properties of 
insulators, as well as neutron scattering experiments on most materials, 
can be explained entirely in terms of the harmonic approximation for the 
ions in the crystal, i.e. by assuming a Hamiltonian of the form (80). There 
are some problems with the harmonic theory. First, at higher tempera- 
tures atoms tend to explore the anharmonic parts of the crystal potential 
more and more, so deviations from predictions of the equilibrium theory 
increase. The thermal expansion of solids in the harmonic approxima- 
tion is rigorously zero. 14 Secondly, some important qualitative aspects of 
transport cannot be understood: for example, the harmonic theory pre- 
dicts infinite thermal transport of insulators! (See A&M Ch. 25 for a 
qualitative discussion of these failures). 

The obvious way to go beyond the harmonic approximation is to take 
into account higher-order corrections to the real-space crystal potential 
systematically, expanding 15 

U = \.H D^\l, m)q e q m + i £ D®(£, m, n)q t q m q n + . . . , (86) 

lm O. £mn 



where 

f) n U 

D^ n \£,m 1 ...n) = 



dq£dq m ...dq n 



(87) 



U4=0 



14 This follows from the independence of the phonon energies in harmonic approx. of the system volume, (see 
above) Since pressure depends on temperature only through the volume derivative of the mode freqs. (see A & M p. 
490), 

15 I have dropped polarization indices everywhere in this discussion, so one must be careful to go back and put 
them in for a 2- or 3D crystal. 
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are the so-called dynamical matrices. 16 Using Eqs. (71,76) we find 

Qi = 7*7 (^-) V2 E Q k e ikai = EK + (88) 



AT \2muj) k k 
Note that the product of 3 displacements can be written 

QiQmQn = j^ m E e t{hae+k2am+han) Q h Qk 2 Qk 3 (89) 
so the cubic term may be written 

h 3 = £ v^(hk 2 k 3 )Q kl Q h Q h (90) 

/ci/c 2 /c3 



with 



l/ (3) (/ci/c 2 /c 3 ) = E £> (3) (^ m > ™) exp[i(M + fc 2 ra + fan)} (91) 

fmn 



Note now that the indices m, n run over all unit cells of the crystal 
lattice. Since crystal potential itself is periodic, the momenta &i, fe, and 
ks in the sum are not really independent. In fact, if we shift all the sums 
in (91) by a lattice vector j, we would have 



y (3) (/ci/c 2 /c 3 ) = E D^(£ + j, m + j, n + ^^e^^^+^^+^g^^+^+fcg)^' 

fan 
fan 

where in the last step I used the fact that the crystal potential U in every 
lattice cell is equivalent. Now sum both sides over j and divide by N to 
find 

V^\k l k 2 h) = E ^ (3) (^^,^)exp[2(M+fem + /c 3 n)]A(/ci + /c2 + /c 3 ), 

fan 

(93) 

16 Recall that the theory with only harmonic and cubic terms is actually formally unstable, since arbitrarily large 
displacements can lower the energy by an arbitrary amount. If the cubic terms are treated perturbatively, sensible 
answers normally result. It is usually better to include quartic terms as shown in figure below, however. 
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where 

A(ife) = e ^(fci+fc 2 +^)«j = l-Y,S(k + G) (94) 
iv j TV k 

and G is any reciprocal lattice vector. 

Return now to (90). We have ascertained that V^^.m, n) is zero 
unless k\ + + ks = G, i.e. crystal momentum is conserved. If we 
expand (90), we will get products of 3 creation or annihilation operators 
with coefficients V^ 3 \ The values of these coefficients depend on the elastic 
properties of the solid, and are unimportant for us here. The momenta 
of the three operators must be such that momentum is conserved up to a 
reciprocal lattice vector, e.g. if we consider the term CLki a -k2 a -h we ^ ave 
a contribution only from k\ + ki + k^ = G. 17 Note this term should be 
thought of as corresponding to a physical process wherein a phonon with 
momentum k\ is destroyed and two phonons with momenta — k^ and — h% 
are created. It can be drawn " diagrammatically" a la Feynman (as the 
1st of 2 3rd-order processes in the figure below). 




Figure 4: Diagrams representing phonon-phonon collision processes allowed by energy and momen- 
tum conservation in 3rd and 4th order. 



17 As usual, processes with G = are called normal processes, and those with finite G are called Umklapp processes. 



24 



Questions: 

• How does energy conservation enter? What is importance of processes 
involving destruction or creation of 3 phonons? 

• If one does perturbation theory around harmonic solution, does cubic 
term contribute to thermal averages? 

• Can we calculate thermal expansion with cubic Hamiltonian? 
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Reading: 

1. Ch. 17, Ashcroft & Mermin 

2. Chs. 5& 6, Kittel 

3. For a more detailed discussion of Fermi liquid theory, see G. Baym 
and C. Pethick, Landau Fermi-Liquid Theory : Concepts and Ap- 
plications, Wiley 1991 

2 Electron-electron interactions 

The electronic structure theory of metals, developed in the 1930's by 
Bloch, Bethe, Wilson and others, assumes that electron-electron interac- 
tions can be neglected, and that solid-state physics consists of computing 
and filling the electronic bands based on knowldege of crystal symmetry 
and atomic valence. To a remarkably large extent, this works. In simple 
compounds, whether a system is an insulator or a metal can be deter- 
mined reliably by determining the band filling in a noninteracting cal- 
culation. Band gaps are sometimes difficult to calculate quantitatively, 
but inclusion of simple renormalizations to 1-electron band structure 
known as Hartree-Fock corrections, equivalent to calculating the aver- 
age energy shift of a single electron in the presence of an average density 
determined by all other electrons ("mean field theory"), almost always 

l 



suffices to fix this problem. There are two reasons why we now focus 
our attention on e~ — e~ interactions, and why almost all problems of 
modern condensed matter physics as it relates to metallic systems focus 
on this topic: 

1. Why does the theory work so well in simple systems? This is far 
from clear. In a good metal the average interelectron distance is 
of the order of or smaller than the range of the interaction, e.g. 
the screening length £ scr ~ (cf e^uij^'^ p where p is the density, 
of order lnm for typical parameters. One might therefore expect 
that interactions should strongly modify the picture of free electrons 
commonly used to describe metals. 

2. More complicated systems exhibit dramatic deviations from the pre- 
dictions of band theory. I refer now not simply to large quantitative 
errors in the positions of 1-electron bands, but to qualitative dis- 
crepancies in the behavior of the materials. The most interesting 
modern example is the class of compounds known as the transi- 
tion metal oxides, including the cuprate materials which give rise to 
high-temperature superconductivity (HTS). 1 

lr The parent compounds of the HTS (e.g. La2Cu04 and YBa2Cu304 are without exception antiferromagnetic 
and insulating. Historically, this discovery drove much of the fundamental interest in these materials, since 
standard electronic structure calculations predicted they should be paramagnetic metals. This can easily be seen 
by simple valence counting arguments. The nominal valences in e.g. LaiCuO^ are La 3+ , 2 ~ , and Cu 2+ The 
La and O ions are in closed shell configurations, whereas the Cu is in an [Ar]3d 9 state, i.e. a single d-hole since 
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2.1 Mean field theory (Hartree-Fock) 

Let's begin with the second-quantized form of the electronic Hamiltonian 
with 2-body interactions. The treatment will be similar to that of Kittel 
(Ch. 5), but for comparison I will focus on a translationally invariant 
system and do the calculation in momentum space. In this case as we 
have shown the Hamiltonian is 2 

(1) 
(2) 




k,k ; 



The 2-body interaction V contains 4 Fermi operators c and is there- 
fore not exactly soluble. The goal is to write down an effective 2-body 

there are 10 d electrons in a filled shell. The 1 hole/unit cell would then suggest a 1/2 filled band and therefore 

a metallic state. The crystal field splittings in the planar Cu environment give the 3d x 2_ y 2 state being lowest 

in energy (see later), and since the n.n.'s are the O's, it would seem likely that the lowest O crystal field state 

in the planar environment, the O 3p, will hybridize with it. Sophisticated LDA calculations confirm this general 

picture that the dominant band at the Fermi level is a 1/2-filled planar Cu d x 2_ y 2- O 3p band. Instead, this 

compound is found experimentally to be an electrical insulator and an antifcrromagnet with a Neel temperature 

of 300K! This strongly suggests that the band calculations are missing an important element of the physics of the 

high-T c materials. In the transition metals and TMO's band theory is notoriously bad at calculating band gaps 

in insulators, because of the presence of strong local Coulomb interactions which lead to electronic correlations 

neglected in the theory. It is therefore reasonable to assume that a similar mechanism may be at work here: 

extremely strong Coulomb interactions in the Cu — O planes are actually opening up a gap in the 1/2- filled band 

at the Fermi level and simultaneously creating a magnetic state. This phenomenon has been studied in simpler 

systems since the 60s and is known as the Mott-Hubbard transition. 

2 For the moment, we ignore the ionic degrees of freedom and treat only the electrons. To maintain charge 

neutrality, we therefore assume the electrons move in a neutralizing positive background ("jellium model"). 
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Hamiltonian which takes into account the average effects of the inter- 
actions. We therefore replace the 4-Fermi interaction with a sum of all 
possible 2-body terms, 

c{4c 3 c 4 ~ -(c{c 3 )4c 4 - (4c 4 )c{c 3 + (c{c 4 )4c 3 + (4c 3 )c{c 4 , (4) 

where the + and — signs are dictated by insisting that one factor of -1 
occur for each commutation of two fermion operators required to achieve 
the given ordering. This can be thought of as "mean field terms", in 
the spirit of Pierre Weiss, who replaced the magnetic interaction term 
Si • Sj in a ferromagnet by {Si)Sj = (S)Sj = —H e ffSj, i.e. he replaced 
the field Si by its homogeneous mean value <S, and was left with a 
term equivalent to a 1-body term corresponding to a spin in an external 
field which was soluble. The mean field (S) in the Weiss theory is the 
instantaneous average magnetization of all the other spins except the 
spin Sj, and here we attempt the same thing, albeit somewhat more 
formally. The "mean field" 

(cL c kv) = (cL c k<r)4k'<W = n ka 5 kk '5 aa > (5) 

is the average number of particles n ka in the state k<r, which will be 
weighted with the 2-body interaction V(q) to give the average interac- 
tion due to all other particles (see below). 3 

3 Note we have not allowed for mean fields of the form (c'c^) or (aa). These averages vanish in a normal metal 
due to number conservation, but will be retained in the theory of superconductivity. 
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With these arguments in mind, we use the approximate form (4) and 
replace the interaction V in (3) by 

/s j r _|_ J. _|_ J. 

v H f = o E v(q) [-(4 (7 c k / (T />4, +q(7 ,c k+q(7 - (cJ c , +q(7 ,c k+q(7 >4 (7 c k / (7 /+ 



2 



kk'q 



+ ( C L C k+q ( j) C k / +q ( T /C k / ( 7 / + (4/+qa /C kv) C L C k+q ( 7 

= - E V(q) (cL c k ( 7>c k+q( 7 C k+q ( 7 + ^(0) E (cLc ka )cl a ,c k ! a/ 

kq kk' 

o aa' 

= E f- E n k _ qa V (q) + nV(0)) c ka c ka , (6) 

where the total density n is defined to be n = E ka n ka . Since this is 
now a 1-body term of the form E k(7 XW(k)a k(7 a k(T , it is clear the full 
Hartree-Fock Hamiltonian may be written in terms of a k-dependent 
energy shift 

(h 2 k 2 
v 



Hhf = E 

ka 



2m 



+ E^k) c k(7 c k(7 , (7) 



E/fiKk) = -En k _ qff V(q) + nF(0) (8) 

q > . ' 



Fock Hartree (9) 

Note the Hartree or direct Coulomb term, which represents the aver- 
age interaction energy of the electron kcr with all the other electrons in 
the system, is merely a constant, and as such it can be absorbed into a 
chemical potential. In fact it is seen to be divergent if V(q) represents the 
Coulomb interaction 47re 2 /</ 2 , but this divergence must cancel exactly 
with the constant arising from the sum of the self-energy of the posi- 



tive background and the interaction energy of the electron gas with that 
background. The Fock, or exchange term 5 is a momentum-dependent 
shift. 

2.1.1 Validity of Hartree-Fock theory 

Crucial question: when is such an approximation a good one for an 
interacting system? The answer depends on the range of the interac- 
tion. The HF approximation becomes asymptotically exact for certain 
quantities in the limit of high density Fermi systems if interactions are 
sufhently long-range! This is counterintuitive, for it seems obvious that 
if the particles are further apart on the average, they will interact less 
strongly and less often, so a mean field theory like Hartree-Fock the- 
ory should work well. This is true if the interactions are short-ranged, 
such that for suffiently low densities the particles spend no time within 
an interaction length. The interaction is typically characterized by a 
strength V and a range a, and if the interparticle spacing ro ^> a, the 
particles don't feel the potential much and the ground state energy, for 
example, of such a gas can be expanded at T = in powers of a/r$. 
If a — > 0, the interaction dispapears and the the only characteristic en- 

4 See e.g., Kittcl 

5 The origin of the term exchange is most easily seen in many-body perturbation theory where the Fock term 
is seen to arise from a scattering process where the particle in state kcr changes places with a particle from the 
medium in an intermediate state. 
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ergy left at T = is the zero-point energy, i.e. the energy scale obtained 
by confining the particle to a cage of size the interparticle spacing ro, 
i.e. /i 2 /(2mrg). This is just the Fermi energy in the case of a Fermi 
system. Thus we might expect HF theory to be quite good for a di- 
lute gas of 3 He, since the 3 He- 3 He interaction is certainly short-ranged; 
unfortunately, liquid 3 He has a ~ ro, so corrections to HF are always 
important. 

What about for a metal? We can do a simple argument to convince 
ourselves that HF isn't applicable for low density systems with Coulomb 
interactions. Let's rewrite the basic 2-body Hamiltonian in dimension- 
less variables, 



(h 2 k 2 \ t Iff (^e 2 \ ( , 

H = £ tJ^ + w,£^^m^^ (10) 



e 2 



2aor 



/ \ 

E ^ C la C ka + Y E C ia C b+qa> C kV C k+q<7 
ka ^ k,k',q W / 



11) 



where I've used the system volume V = AT(4/3)7rro, Bohr radius ao = 
h 2 /me 2 , defined r s = ro/ao, and scaled all momenta as k = kr^ 1 , etc. 
Applying dimensional analysis, we expect the two terms T.k 2 c^c and 
E cV(l/g 2 )cc to be of order 1, so that it is clear that the interaction 
term becomes negligible in the limit r s — > (high density). On the 



other hand, it's also clear that had the 1/q 2 factor not been present 
in the interaction, it would have scaled as \/r s instead, and become 
negligible in the low density limit. 

One can do a more careful, formal perturbation analysis for a simple 
quantity like the ground state energy (H) of the electron gas, which is 
discussed in many-body physics texts. The result is 

Eq = Eq f + E corr (12) 

with Eq f the ground state energy calculated in the independent par- 
ticle approximation with the Hartree-Fock energy shifts, and E corr the 
correlation energy, defined to be the part of the g.s. energy not cap- 
tured by HF. As the density increases, so does ef, which represents 
the average kinetic energy per electron; this increase is faster than the 
increase of the correlation energy, as 

K-E. = \e F = 2 -^ Ryd (13) 

' S 

-0 916 

RE.\ HF = Ryd (14) 

r s 

E corr = (0.0622 log r s - 0.096 + . . .) Ryd (15) 

so it is clear the correlation term is less singular than the HF term in the 
limit r s — > 0. At higher densities, particles are effectively independent. 6 

6 For a more detailed discus of these terms, sec Fetter & Wallecka, Quantum Theory of Many-Particle Systems 
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2.1.2 Problem with Hartree-Fock theory 



Although we argued that the Hartree-Fock approximation becomes a 
better approximation in the limit of high density for electrons interacting 
via the Coulomb interaction, it never becomes exact, as one can see by 
examining the Fourier transform of the Hartree-Fock energy shift. The 
Hartree term itself (being k independent) is absorbed into a redefinition 
of the chemical potential, so the shift is (T=0): 

F(x)f 

1 47re 2 
L 6 \k'\<k F |k - k.'\ z 



2e 2 k, 



7T 



-F(k/k F ), (17) 




where F(x) is a function which has a log divergence in slope at x = 1, 
i.e. at the Fermi level. 7 This means while the energy shift might be 
small compared to the Fermi energy ve^, the velocity of an electron is 
de^/ d\i\k F1 which contains a term which is infinite. 



7 The integral is straightforward & useful: 
f kF k 2 dh 

k 



i 1 _ [ kF k2dk r 1 J i _ j_ f kp , , , , 

L3 ^ (k - k') 2 ~ 7o 4tt2 dX fc2 _ fc /2 _ 2kk' X ~ 8^ 2 fc 7o g 



fc-A;' 



1 /fc|-fc 2 



8tt 2 



k 



log 



k,p — k 



2k, 



so with z = fc/fcir and (17) we have 



1 - z 2 


1 + z 


1 


F{z) = — log 


1 - z 


+ 2 



(18) 
(19) 

(20) 
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This problem can be traced back to the long-range nature of the Coulomb 
force. Two electrons at large distances r — r' don't really feel the full 
l/|r — r'| 2 but a "screened" version due to the presence of the inter- 
vening medium, i.e. The electron gas rearranges itself to cancel out the 
long-range part of V . 

2.2 Screening 

2.2.1 Elementary treatment 

To see how this works, let's first examine the problem of a single fixed 
(i.e. infinite mass) charge placed inside an electron gas. We are inter- 
ested in how the electron gas responds to this perturbation (recall in the 
HF approximation the perturbation would simply be the interaction of 
the charge with the uniform gas) . But since the electrons are mobile and 
negatively charged, they will tend to cluster around the test charge if 
its charge is positive, and be repelled (i.e. have a lower density) around 
the test charge if it is negative. In addition, Fermi statistics lead to 
long-range oscillations of the electron charge density far from the test 
charge. 

This is not entirely a "Gedanken experiment". Impurities in solids 
may have different valences from the host, and thus acquire a localized 
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charge, although the entire solid is still neutral. As an example of some 
relevance to modern condensed matter physics, consider what happens 
when you put an extra Oxygen atom in the CuC>2 plane of a cuprate 
high-T c superconducting material. Although the crystal structure of 
these materials is complex, they all contain Cu02 planes as shown in 
the figures. The interstitial oxygen will capture two electrons from the 
valence band, changing it from a 2s 2 2p 4 to a 2s 2 2p 6 electronic configu- 
ration. The impurity will then have a net charge of two electrons. For r 



_ _ 2e _ 



a) b) 

Figure 1: a) The Cu0 2 plane of cuprate superconductor, b) Interstitial charged O impurity. 

near the impurity, the electronic charge density will be reduced by the 
Coulomb interaction. The electronic number density fluctuation is 8 

n(r) » f: F emv) N(co + eS^r))dw » N(w')dw' (21) 

8 A note on signs: the electrostatic potential S<fi(r) < due to the excess negative charge at the impurity. You 
might worry about this: the total change in (f> should include the self-consistently determined positive charge 
screening the impurity. We have fudged this a bit, and the correct separation between induced and external 
quantities will become clear later in the discussion. Note we take e > always, so the electron has charge — e. 
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Note that this equation involves the double-spin density of states, which 
we will use throughout this section. On the other hand, in the bulk far 



around impurity 

CO 



bulk 




Figure 2: Local repulsion of charge due to shift of electronic density of states near impurity 



away from the impurity, 5(/)(r ou ik) = 0, so 



or 



n{r bu i k ) 



Ep+e5(j)(r) 



N(oj)doj 



(22) 



Sn(r) f» J^"'^> N(w)doj - J^ F N(w)doj 
Let us assume a weak potential \eS<p\ <C e F ; therefore 
Sn(r) « N(e F ) [e F + eS(p - e F ] = +eS<pN 



(23) 



(24) 



We can solve for the change in the electrostatic potential by solving 
Poisson's equation. 



V 2 5(j) = -AnSp = AneSn = 47re 2 N 5(f) 



(25) 



Define a new length k T p : the Thomas-Fermi screening length, by k\ F = 
47re 2 iVo, so that P's eqn. is V 2 5<j) = r~ 2 d r r 2 d r 8<j) = k^pdcj), which has 
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the solution S(f)(r) = C exp(— kxFr)/r. Now C may be determined by 
letting N(sf) = 0, so the medium in which the charge is embedded 
becomes vacuum. Then the potential of the charge has to approach 
q/r, so C = q, i.e. in the electron gas 

5<t>(r) = q - , (26) 

r 

where q = — 2e for our example. 

Let's estimate some numbers, by using the free electron gas. Use 

o 

h F = (37TV) 1 / 3 , a = h 2 /(me 2 ) = 0.53A, and N(e F ) = mk F /{h 2 7T 2 ). 
Then we find 

2 a 7T _ a 



TF 4(3 7 r 2 n) 1 /3 ~ 4nV3 



1 (n\- l l Q 



2 



o 



In Cu, for which n ~ 10 cm (and since ao = 0.53 A) 

1 fio 23 ^ -1 ^ 6 ° 



krF « " " n? « 0.5 x 10" 8 cm = 0.5A (28) 

TF 2(0.5 x 10" 8 )- 1/2 V ^ 

Thus, if we add a charged impurity to Cu metal, the effect of the impu- 

o 

rity's ionic potential is exponentially screened away for distances r > |A. 

The screening length in a semiconductor can clearly be considerably 

longer because n is much smaller, but because of the 1/6 power which 

appears in, even with n = 10~ 10 , k FF is only about 150 times larger or 
o 

75A. 
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extended 
bound 



Figure 3: Screened impurity potentials. As the density decreases, the screening length increases, 
so that states which were extended bound. 

What happens in the quantum-mechanical problem of an electron 
moving in the potential created by the impurity when the screening 
length gets long? As shown in the figure, the potential then correspond- 
ingly deepens, and from elementary quantum mechanics we expect it 
to be able to bind more electrons, i.e bound states "pop out" of the 
continuum into orbits bound to the impurity site. 

In a poor metal (e.g., YBCO), in which the valence state is just barely 
unbound, decreasing the number of carriers will increase the screening 
length, since 

k^F ~ n~ 1/6 . (29) 

This will extend the range of the potential, causing it to trap or bind 
more states-making the one free valence state bound. Of course, this 
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has only a small effect 0(1/ N) effect on the bulk electrical properties of 
the material. Now imagine that instead of a single impurity, we have a 
concentrated system of such ions, and suppose that we decrease n (e.g. 
in Si-based semiconductors, this is done by adding acceptor dopants, 
such as B, Al, Ga, etc.). This will in turn, increase the screening length, 
causing some states that were free to become bound, eventually possibly 
causing an abrupt transition from a metal to an insulator. This process 
is believed to explain the metal-insulator transition in some amorphous 
semiconductors. 

2.2.2 Kubo formula 

Many simple measurements on bulk statistical systems can be described 
by applying a small external probe field of some type to the system at 
t = 0, and asking how the system responds. If the field is small enough, 
the response is proportional to the external perturbation, and the pro- 
portionality constant, a linear response coefficient, is given always in 
terms of a correlation function of the system in the equilibrium en- 
semble without the perturbation. In Table 1 I list some standard linear 
response coefficients for condensed matter systems. 
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System 


Perturbation 


Response 


Coefficient 


metal 


electric field E 


current j 


conductivity a 




temp, gradient VT 


heat current jg 


thermal cond. k 




point charge q 


density fluct. Sn 


density correlation 
function Xc 




magnetic field B 


magnetization 
M 


susceptibility Xs 



The general theory of linear response is embodied in the Kubo formula. 
Suppose a perturbation H'(t) is turned on at time t = 0. The idea 
is to express the subsequent time-dependence of expectation values of 
physical observables in terms of matrix elements of the perturbation 
in the unperturbed ensemble. The unperturbed ensemble is described 
by Ho, which may involve interactions, but is usually assumed to be 
time-independent. All observables can be calculated by knowing the 
time evolution of the statistical operator p(t), given by the Heisenberg 
equation 

i^=[H + H',p] (30) 
First one defines a canonical transformation 10 : 

p(t) = S(t)p(t)&] S(t) = e 1 ^ 1 (31) 

9 R. Kubo, J. Phys. Soc. Jpn. 12, 570 (1957). 

10 The idea is to remove the "trivial" time-evolution of p on H, to isolate the dependence on H'. 
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from which one can see by substituting into (30) that 

i % = \H',P\ (32) 
with H' = SH'S^ . This has the formal solution 

The initial condition is p(0) = S(0)p(0)S(0y = p(0), and we can iterate 
the rhs by substituting p(0) for p(0), etc. 11 Since we are interested only 
in leading order, we write 

p(t)-p(0)-iJolH',p(0)}dt'. (34) 

If we now multiply on the left by 5' and on the right by S, we obtain 
the leading order in H' change in the original p: 

pit) ~ p(0) - iSHt){f [H\p(Q)\dt'} S(t), (35) 

which can now be used to construct averages in the usual way. The 
expectation value of an operator O at nonzero T and at some time t is 

(6) = Tr(p(t)6) (36) 
~ Tr (p(0)6) - i J dt'Ti (&(t)[H', p(0)]S(t)6) 
= Tr (p(0)d) - i J dt'Tr (p(0){H>, 6 H (t)]) , (37) 

n Note p(0) is the unperturbed but still interacting density matrix (statistical operator). I reserve the symbol 
po for the nonintcracting analog. 
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where in the last step the cyclic property of the trace and the definition 
of the Heisenberg operator Oh = SOS* was used. The change in the 
expectation value due to the perturbation is then 



5(6) = -ij dt'Tr (p(0)[H', 6„(t)]) (38) 



Since typically H' is proportional to some external (c-number) field, we 
can write H' asH'(t) = J d 3 r£(r)0(r, t) (so H (t) = Jd 3 rB H (r : t)</>(r, t)). 
and 

S(A(1)) = -ififT±(mMl),B H (l%mdl' (39) 
= j™G AB {\A')^\')d\'. (40) 

with the notation 1 = r, t, a, Y = r r , t' , a', I dl f = E a > I d 3 r'dt', etc. 12 
2.2.3 Correlation functions 

As a concrete example, consider the change in the density Sn(r, t) 
of a system in response to a local change in its density perhaps at 
a different place and time 5n(r',t'), where n(r) = {ip*(r)ip(r)) , and 
{ip + (r)ip(r))fr /=Q = no- It's usually convenient to set no = 0, i.e. we 
assume an overall neutral system. The linear response function is then 

X (r, t; r', t 1 ) = -»([ft(r, t),n(r', t')]) 6 , =0 6(t - t'), (41) 

12 The object Tr (J)(0)[Ah (1), -Bff(l')]) which appears on the right-hand side of (38) is related to something 
called a retarded 2-particle Green's function, which I have promised not to oppress you with. 
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where the exectation value is taken in the unperturbed system. The 
function G nn measures the average amplitude in the unperturbed en- 
semble a particle is to be found at r', t' and subsequently at r, t. If the 
particles are statistically independent, % factorizes, (nn') ~ (n}{n f ) = 
(no correlations), so it has the interpretation of a correlation function. 

2.2.4 Dielectric constant 

The simplest way to analyze this problem conceptually is to ask what 
is the potential a long distance from a test charge placed into the inter- 
acting electron gas medium. This potential will be the sum of the bare 
potential produced by the test charge in a vacuum, plus the induced 
potential due to the polarization of the medium. 

Let's use what we learned about linear response in the last section to 
calculate the electric potential in the system </>(r, t) due to an external 
potential <j) ext (r' ', t') (you may think of this as the potential due to a test 
charge if you like). From electrodynamics we recall these two are sup- 
posed to be proportional, and that the proportionality constant is just 
the dielectric constant e. By analogy we write, in Fourier transformed 
language, 

^ f (q,u,)=e(q,u#(q,u,) (42) 
It is the dielectric constant of the medium (which is directly related to 
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the conductivity) which we now wish to calculate. 

The total potential in the material is the sum of the external potential 
and the induced potential, 

cj) = (j) md + cj) ex \ (43) 
but in fact we can calculate the induced potential (j) ind by 

• first finding the induced change in density 8p md due to the external 
potential, using linear response theory; then 

• using Poisson's equation to calculate the induced electric potential. 

The additional term in the Hamiltonian in the presence of the "test 
charge" is 

H' = ej d 3 r n(r)0(r, t) (44) 

where h is the 2nd-quantized density operator The induced change 
in charge density Sp = e(h) is then just given by the Kubo formula as 
above, 13 

<5p w (l) = - ? e 2 / 7dVTr(p(0)[n^(l),n ff (l')])^ t (l')dl' (45) 
ee e 2 / oo X (l,l')r' i (l')dl' (46) 
where the density-density correlation function is 14 

X (l, 1') = -iTv(p(0)[n H (l),n H (l')])9(t - t'). (47) 

13 Don't confuse charge density p with statistical operator p. 

14 Instead of G„n, it's more conventional to use the notation \nn or just \- 
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For a translationally invariant system in Fourier transform rep., the 
convolution (46) reduces to 



V w (q, W )=e 2 x(q jW )^ i (g jW ). (48) 
Now using Poisson's equation q 2 <j) ind = ATi5p md , and Eq. (43), we find 15 

47TP 

<F"(q, ") = — X(q, ")^(q, ") = V(q)x(q, ^ (49) 

so 




(50) 



2.2.5 Lindhard function 

We've found the exact structure of the dielectric response, 16 but not 
really solved it because we still don't know the density-density correla- 
tion function %. For the interacting system this can be a very difficult 
problem, but it is not too hard to find it for the free fermion gas. We 
will need 

15 Compare Eq. (25) 

16 Warning! Compare the treatment here with e.g., A& M pp. 337-340, and you will see that they arrive at the 
apparently different result 

e(q,o;) = l-y(q)x(q,w), (51) 
which is also claimed to be exact. This is because x is defined to be the response to the total potential 

p i " d (q, W )=xU &M (q,w)0(q,a;) (52) 

which is very difficult to calculate since <j> is self-consistently determined. 
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• The definitions of Fourier transforms 

X (q, w) = / d 3 (r - r') / d(t - fje^-^e-^xd , l') 
cl(t) = J £re & - T i>\\) 

c k (t) = / d 3 re- ik 'V(l) (53) 

• the integral expression for the theta function 

v ; J 2ni u' + ir]+ ' v y 

where 7/ + is a positive infinitesimal. 17 

• the solutions of the Heisenberg equation of motion idtOnif) = 
[(9#(t), for cj^t) and Cko-(^) for free fermions (check): 

ct(t) = cle^ (55) 
Ck„{t) = c^e-^' (56) 

• The commutator (check!) 

[ C !l%> C k 3 C h] = ^k 2 k^k x C k 4 — dkik 4 c\ 3 Ck 2 (57) 

• The expectation values in the noninter acting ensemble of the num- 
ber operator at finite temperature T are 

(n k > = /(<*), (58) 
where / is the Fermi function. 

17 Thc sign of the infinitesimal is important. It determines in which complex half- plane the contour can be 
completed to evaluate the integral by Cauchy's theorem. Check it! 
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Then (supressing spin indices) 

([n H (l),n H [l'}) = Ti{p Q [n H {\),n H [\' 
= E e^ k - k ^- r e^ k 3- k 4 )-^[4(t)c k2 (t),c[ 3 (Oc k4 (tO]) 

ki...k 4 

= E ^ (kl - k2 > r ^ (k3 - k4 > r, ^ ([ek 3- ek 4] t,+ [ e ki-%]^([4 iCk2 , c k3 c k4 ]> 

ki...k 4 

= ^ e i(ki.[r-r / ]+k 2 -[r / -r]) e i(e k2 -6 kl )(* / -t)^ k ^ 
kik 2 

_ ^ ^(k4-[r-r / ]+k 3 -[r'-r]) e ,( ek3 - ek4 )(t / -t)^ k ^ ^ 
k 3 k 4 

so that (using (47) and (53), and putting back the spin sum) 

(60) 



xo(q, ^) = E 77 , x ]_ . 

k<r U - (f k+q - f k ) + IT] 



This is the so-called Lindhard function, the charge susceptibility of the 
free Fermi gas. Note that in the free gas the major contributions to the 
integral (60) come from contributions where e k+g — e k — uj = 0. These 
correspond to 2-particle "particle-hole excitations" where the Fermi sea 
is given a little energy uj which promotes a particle from below the 
Fermi surface (say k) to above (say k + q) . In the homework you will 
investigate when such excitations are possible within this model. 

The use of the free gas susceptibility Xo f° r X m Eq. (50) is a rather 
poor approximation, as it completely ignores the effect of the long-range 
Coulomb interaction on the intermediate states. The self-consistent field 
method, or random phase approximation, replaces the induced charge 
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without interactions with the screened charge: 



Solving (50) for e gives 



And And\ 

p p \v=o/c Xo 



v=o 



^ext 



(61) 



e(q, to) ~ 1 - V(q) X o(q, w) (RPA/SCF) (62) 



This is sometimes called the Lindhard approx. to the dielectric function. 
2.2.6 Thomas- Fermi theory 

Let us consider the static limit of the theory we currently have. This 
will allow us to answer the original question we posed: if we insert a 
charge somewhere into a metal, what is the net electrostatic potential 
felt at some distance r away by a second charge? Taking the uj — ■> 
limit in (60), we find 

Xo(q, 0) = £ /(£k+q) 'I (£k i (63) 

First note that if we further take the limit q — > we find (recall ATo is 
double-spin dos!) 

Xo - -2 E ^ ^ -N J df k - AT / df k J(f k ) = -7V . 

T«Te? (64) 
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The dielectric constant in the long- wavelength (i.e. small q) limit may 
thus be written 

e(q, 0)~1 + ^N = 1 + %, (65) 
q A q A 

where kxF is the same Thomas- Fermi wave vector k TF = 47re 2 A^ we 
discussed earlier. Now we go back: if we had placed a charge e in a 
vacuum, it would have produced a potential <jf xt (r) = e/r, or <j) ext (q) = 
4ne/q 2 everywhere in space. From (42) we see that the actual potential 
in the medium is 



m = rt)/ e( ,) = T i^ ? = (66) 

i.e. the Thomas-Fermi wavevector cuts off the divergence of the bare 
long-range Coulomb interaction. In real space 

~^ (67) 

so the long-range interaction is indeed replaced by an exponentially cut 
off short-range one, with screening length scale k^ F . 

2.2.7 Friedel oscillations 

The Thomas-Fermi theory is very intuitive and describes realistically 
how the singularities of the long-range Coulomb interaction are cut off by 
screening. It suffices to eliminate the anomalies encountered in Hartree- 
Fock theory of transport and density of states near the Fermi level. But 
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because it is a long-wavelength approximation, it cannot describe ade- 
quately the response of the electron gas to a short-range perturbation. 
Our initial example of a point charge placed in an electron gas was just 
such a case. To get an accurate description of the screened impurity 
potential, the finite-^ dependence of the charge susceptibility must be 
accounted for. This calculation involves simply replacing our Thomas- 
Fermi dielectric function etf = 1 — k^ F /q 2 with the expression involving 
the full Lindhard function 



i 2 

ELindhard = 1 " V(q) X o(q, W = 0) = 1 - "^(^)> (68) 



where F is the same function as in (20). Then we repeat the same 
calculation for the impurity potential 

*M = £ ^ T (q) - 7^^> (69) 
q y — k TFj?( q \ (2 + x z ) z r 6 

where x = /ctf/(2/cf)- The main interesting feature of this result is 
of course the fact that the potential no longer has the simple Yukawa 
form with exponential decay, but in fact an oscillatory 1/r 3 behavior. 
The last step in (69) is a bit long, 18 but I have attempted to give you 
a flavor of it in the homework. One of the main conclusions of the 
exercise there is that the potential is very sensitive to the details of the 
charge susceptibility at 2fcp, which the Thomas-Fermi theory clearly 

18 See e.g. Fetter & Wallecka 
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gets wrong. Ultimately Friedel oscillations (and RKKY oscillations, 
their counterpart for magnetic spins) originate from the sharpness of 
the Fermi surface. 

2.2.8 Plasmons 

An examination of the dielectric function of the solid, e.g. Eq. (50), 
shows that the dielectric function can be zero, meaning the system re- 
sponds very strongly to a small perturbation (see Eq. (42). These are 
the excitation energies of the electron gas. In the random phase approx- 
imation (62) this means there are excitations when 

V(q)xo(q,«) = l (70) 

Consider first a finite size box where the energy levels e k are discrete. 
This means Xo(q?^) ls a sum of terms containing simple poles at the 
particle- hole excitation energies u; q (k) = e k + q — e k - Xo will change sign 
and diverge as uj passes through these energies, therefore there will be 
a solution of (70) for each excitation energy. It is simplest to focus on 
the limit at T = 0, where 

E ( I I 

k<k F [UJ - (£ k+q - f k ) + IT] (J - (£ k - £ k+q ) + IT] 

\k+q\>kp 

In other words, the poles in the complex ui plane are particle-hole ex- 
citation energies ±6Jk(q) = £k+q — £k, where |k + q| is always above 
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Fermi surface and k always below. 19 The situation is depicted in figure 
4. At T = 0, there is a maximum frequency above which there are no 

(q) X,(q,(0) 



*°max W pl 




Figure 4: Eigenfrequencies of electron gas in RPA. 



more particle-hole excitations possible for given fixed q 1 i.e. the largest 
excitation energy occurs when k = kp and k, q are parallel, so that for 
a parabolic band, €k = /c 2 /(2m), 

(72) 



UJ 



max 



kFq + q 



m 



2m 



19 To arrive at this result, note from (60) 

/(£k) - /(&+q) 



Xo(q,w) = 

k 

- E 

k 

- E 

k 

= E 

k 

- E 

k 

= E 



W - (6c+q - £k) + «?7 



/(&)(i - /(a +q )) - /(a +q )(i - /(a)) 





- (6c+q - 


&) + ^ 


/(&)(!- 




/(e k+q )(i-/(Ck)) 1 


W - (Ck+q 


- fk) + ir? 


^-(Ck+q-?k)+»f?J 


/(&)(1- 




/(&)(l-/(£k-q)) 1 


W - (£k+q 


- £k) + 27? 


w - (Ck - £k-q) + i»? J 


/(&)(!- 


KM) 


/K-k)(l-/K-k-q)) 


W - (£k+q 


- fk) + ir? 


W - (£-k - £-k-q) + if] 


/(&)(!- 




/(a)(i-/(Ck +q )) l 



k <. ^ - (Ck+q - Ck) + ir? w - (£ k - £k+ q ) + ir? , 

1 



lim 



^/(a)(i-/(^+q)) 

k 

- E 



i 



k+q >k f 



u - (£k+q - £k) + ir? w - (£ k - Ck+q) + ir? 
1 1 



w - (Ck+q - £k) + ir? w - (£k - Ck+q) + i?7 
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There is another possible root of e, as also shown in Fig. (4). In the 
limit uj > (jJ max , we may expand (71) for small q, and find 

2q 2 



x(q,^) - 



E 

mu^ k<k F 
k F q 2 
3tt 2 



2k q /k q\ 2 

1 + - + 3 — - + 



muj 



5 \muj , 



(73) 



Now we look for a solution to ReFxo = 1 and find to leading order 20 



uj = (J p i(l + 0(q/k F y + 



(74) 



with 



/47me 2 \ 



m 



1/2 



(75) 



which is of course the plasma frequency you found by elementary meth- 
ods last semester. In contrast to the particle- hole excitations, which 
involve exciting only 1 particle from its position in the Fermi sea, a 
plasma oscillation perforce involves motions of all particles and is there- 
fore referred to as a collective mode of the system. Plasmons have been 
observed indirectly in metals, primarily in thin metallic films. 



2.3 Fermi liquid theory 

In a good metal the average interelectron distance is of the order of 
or smaller than the range of the interaction, e.g. the screening length 

20 Note this is a result for 3 dimensions. It is a useful exercise to obtain the analagous result for 2D, which has 
w ~ q 1 / 2 , i-e a gapless mode. 
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Figure 5: Excitations (poles of e(q, oj) x ) at T = 0. 

kjp « 0.5 (n/al) , where n is the density, of order 0.1-lnm for typ- 
ical parameters. One might therefore expect that interactions should 
strongly modify the picture of free electrons commonly used to describe 
metals. It turns out, however, that for T <C Tp the important de- 
grees of freedom are mostly frozen out, and only a fraction T/Tp of 
all electrons near the Fermi surface participate in interaction processes. 
This leads to the possibility of a mapping between the free Fermi gas 
and the interacting Fermi system. IF the elementary excitations of the 
many-body system can be obtained from the single-particle states of the 
free electron gas by turning on the interaction adiabatically (and if the 
ground state has the full symmetry of the Hamiltonian), the distribution 
function of the interacting system would be the same as that of a 
free gas, and the particle and hole excitations of the gas would evolve 
into "quasiparticle" and "quasihole" states. 
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Fermi gas Fermi liquid 

a) 



K k 
b) F 



k n k 

c) F 



Figure 6: a) Evolution of quasiparticle states; b) T = Quasiparticle distribution function n°; 
c) T = true electron distribution function (c k Ck). 

2.3.1 Particles and holes 

To understand what this really means, let's go back and review the 
definitions of the terms particles and holes. Let's take N free Fermions, 
mass m in a volume L 3 , where we know that the eigenstates are Slater 
determinants of N different single particle states, each given by 



1 



Mr) = 



ik-r 



(76) 



Let's stick to T = for the moment, and recall that the occupation of 
each state is given by = 6{k — ftp), where kp is the radius ermi sea. 

Now let's add one "test" particle to the lowest available state k = kp- 
We then have for the chemical potential at T = 0, 

dE k\ 



fi = E (N + 1) - E (N) = 



dN 2m 



(77) 



Particles and holes are excitations of the filled Fermi sea. Excitations of 
the gas consist of taking a certain number of particles and moving them 
to the other side of the Fermi surface, yielding particles above and an 
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equal number of holes below the Fermi surface. These excitations can 
be described in terms of the changes in the particle occupation numbers 



5n p = n p — rip-. 



Sn k = 



78) 



5k ft for a particle k' > kp 
—S^k' fo r a hole k' < kp 
At nonzero temperatures, a thermal fluctuation will create changes in 



particle ex. 




N(E) 



Figure 7: Particle and hole excitations of the Fermi gas. 

occupation numbers Sn p ~ 1 only for excitations of energy within ksT 
of ep. Note that the total energy of the free gas is completely specified 
as a functional of the occupation numbers: 



E - E = T.w-(n k - n\) = £— Sn 



k 2m 



k 2m 



(79) 



Now take the gas and put it in contact with a particle reservoir. 
The appropriate thermodynamic potential is then the Gibbs free energy, 



32 



which for T = is F = E — fiN, and 

F - F = £ - n) Sn k = T,^ k Sn k . (80) 

The free energy of one particle, with momentum k and occupation 
dn^ri = #k,k' is and it corresponds to an excitation outside the Fermi 
surface. The free energy of a hole Sny = — S^y is — ^k, which cor- 
responds to an excitation below the Fermi surface. For both species 
then the free energy of an excitation is |^| , which is always positive; 
ie., the Fermi gas is stable with respect to excitations as it must be. 
Quasiparticles and quasiholes in interacting system 

Consider a system with interacting particles an average spacing a 
apart, so that the typical interaction energy is ^e~ a ^ r ' TF . As discussed 
above, we now suppose that we can create this system by turning on 
the interactions adiabatically starting from the free gas in a time t, such 
that the system evolves while staying always in its ground state. 

If a state of the ideal system is characterized by the distribution n^, 
then the state of the interacting system will evolve adiabatically from n£ 
to rik- In fact if the system remains in its ground state, then = rik, i.e. 
the occupation of each state labelled by k won't change (although the 
energy will). This is an assumption, and we will be studying some coun- 
terexamples to this ansatz (superconductivity and magnetism). But in 
general Landau expected this to hold if the symmetry of the interacting 
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system was the same as the noninteracting one, and this appears to be 
the case for simple metals. 

Let's follow what happens to a particle of momentum k added to 
the top of the non-interacting Fermi gas when we slowly turn the inter- 
action knob. As U increases, two things happen. The k state of the 






t = 
11=0 



turn on interactions 



t = t 

U= — e TF 
a 



Figure 8: Evolution of particle in state k on top of Fermi sea as interaction is turned on. Particle 
becomes "dressed" by interactions with the rest of the system which changes the effective mass 
but not the momentum of the excitation ("quasiparticle"). 

original particle is no longer an eigenstate of 7Y, so it decays into the 
new excitations of the system, which according to our ansatz are also 
single-particle-like. Nearby particles also decay into the original state; 
this mixing has the effect of "dressing" the original free particle with 
a "cloud" of excitations of the surrounding system. However since mo- 
mentum is conserved, the net excitation (called a quasiparticle)) can 
still be labelled with original momentum k. In the same way, holes of 



34 



momentum k below the Fermi surface evolve into quasiholes. Occasion- 
ally an analogy from classical mechanics is useful: we think of a bullet 
being fired into a large container filled with cotton. It will drag some 
cotton with it, and create a "particle+cloud" with increased mass but 
the same momentum. 

It is too much to hope that the interactions between quasiparticles 
will vanish altogether. The iV-particle ground state wave- function may 
still be Slater-determinant like, but this doesn't necessarily imply that 
the quasiparticles are infinitely long-lived (exact) eigenstates of the sys- 
tem. So if each quasiparticle state k has lifetime Tk, we have to worry 
a bit about the adiabatic switching on procedure if Tk < t. If so, the 
quasiparticle will decay before we finish switching, and we won't be able 
to go reversibly back and forth. If we decrease t too much so again 
Tk ^> t, then the switching on of U may not be adiabatic, i.e. we may 
knock the system into a state other than its ground state. It was Lan- 
dau's great insight to recognize that these objections are overcome for 
quasiparticle states sufficiently close to the Fermi energy. In this case 
the Pauli principle strongly restricts the decay channels for a quasipar- 
ticle, and one can argue (see below) that the lifetime becomes infinite 
at the Fermi energy itself. Therefore for any time t there should be a 
narrow range of energies near the Fermi surface for which the Tk is long 



35 



enough to make the whole construction work. 



2.3.2 Energy of quasiparticles. 

As in the free gas, excitations will be characterized only by the deviation 
of the occupation number from the ground state occupation n k 21 

Sn k = n k - n k . (81) 

At sumently low temperatures Sn k ~ 1 only for those excitations near 
the Fermi surface where the condition r k ^> t is satisfied. Therefore we 
should be able to formulate a theory which depends only on 8n kl not on 
n k or n k , which are not well defined for k far from the Fermi surface. n k 
and n k should never enter independently and are not physically relevant. 

For the noninteracting system the dependence of the energy (T = 0) 
E on 8n k is very simple: 

k 2 

E - E = E — £n k . (82) 
k 2m 

In the interacting case E[n k ] is more complicated generally, but if we 
take 5rik to be small (just a few excitations above the ground state) 
then we may expand: 

E[n k ] = E + E e k Sn k + 0(Sn 2 k ) , (83) 
k 

21 Note we will now take over the notation n£ which in the previous discussion meant the distribution function 
of the noninteracting system. Now it means the ground state of the interacting system. They are the same by 
Landau's ansatz. 
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where we now define ek = SE/dn^. If Sn^ = #k,k', then E ^ Eq + e^; 
i.e., the energy of the quasiparticle with momentum k 7 is e^. 

As discussed, we will only need €k near the Fermi surface where Sn^ 
is 0(1). So we may expand ek around the Fermi level, 

e k «/i+(k-k F ) • V^e^ (84) 

where V^€k = the group velocity of the quasiparticle. We no longer 
know the exact form of ek, but some exact relations must be obeyed 
because our system obeys certain symmetries. If we explicitly put the 
spin-dependence back in, we have symmetries 

ek,o- = e-k,-^ time-reversal (85) 
tk,a = e_ k ,a parity (86) 

So unless we explicitly break T-symmetry by e.g. applying a magnetic 
field, we have e^ a = e_k, a = tk-a- Furthermore, as usual €k has to 
respect the point and space group symmetries of the system, e.g. for 
an isotropic system €k depends only upon the magnitude of k, |k|, so 
k and = Vek(|k|) = k(dek/dk) are parallel. 22 Define (m*) -1 as the 
constant of proportionality at the Fermi surface 23 

Vk F = k F /m* (87) 

22 For a cubic system ek is invariant under 90° rotations, etc. 

23 Note kF is not changed by interactions (Luttinger's theorem for isotropic system) 
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Then for the interacting system at the Fermi surface 



m*k F 



(88) 



interacting 



the same form as for the noninteracting system. Note the m* (usually 
but not always > m) accounts for the dressing of the quasiparticle 
particle (bullet + cotton!). In other words, as far as the density of levels 
at the Fermi energy is concerned, the only thing which changes is the 
effective mass of the fermionic single-particle excitations. 

2.3.3 Residual quasiparticle interactions 

Let's go back to the Gibbs free energy of our system of quasiparticles 
characterized in some generalized excited state by distribution function 
Sn k : 





where again Fq, Eq etc. characterize the ground state. Thus, 



N-N = T,Sn k 



(90) 



p 



and since 



E - E « £ e k Sn k + G{5n 2 ) 



(91) 



k 



we get 



F - F « E (e k - //) Sn k = £ £ k 8n k + G(Sn 2 ) . (92) 



k k 
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As shown in Fig. 9, we will be interested in excitations of the system 
which deform the Fermi surface by an amount proportional to 5. For 
example, along the positive k x axis, our excited system has Sn^ = 1 
for some states k above /cp, whereas along the negative k x axis some 
states have Sn^ = — 1. The approximation to the free energy above will 




Figure 9: Small deformations of the Fermi sphere, proportional to S. 

remain valid provided 

-^£|<fak|<l. (93) 

If we stick with our original criterion of using only states near the Fermi 
surface to construct our distortion, the £k for those states with Sn^ ^ 
will also be of order 5. Thus, 

E^ k -^ 2 ), (94) 

k 
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so in order to keep terms of order S 2 consistently we must also add the 
next term in the Taylor series expansion of the energy : 




95 



where 



fk,k' — 



SE 



(96) 



The term proportional to fay, was proposed by L.D. Landau. It de- 
scribes the residual interactions of the quasiparticle excitations which 
remain even after they are "dressed" . A glance at the dimensions of the 
new term shows that fay ~ l/V . But if fay is an interaction between 
quasiparticles, each of which (in plane-wave like state) is delocalized over 
the whole volume V, so the probability that two will interact is roughly 
~ kjp/V. Therefore 

' " (97) 



fay ~ k T 3 F /y 



We can proceed as earlier with the quasiparticle energy ek and ask how 
this unknown interaction function reflects the underlying symmetries of 
the physical system. To this end we reintroduce the hidden spin degrees 
of freedom. First in the absence of an external magnetic field, the system 
should be invariant under time- reversal, so 



/ktr,kV — f-k-a -k'-a' : 



(98) 
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and, in a system with reflection symmetry (parity) 

/ktr,kV = f-ka-k'a' • (99) 

Then since this implies 

fka,k'a' = /k-^k'-a' • (100) 

it must be that / depends only upon the relative orientations of the 
spins d and a 1 ', so there are only two independent components for given 
k, k 7 , e.g. /kT,k'T an( ^ /kt,k'|- It 18 more conventional to decompose / 
into spin- symmetric and antisymmetric parts: 

fk,k' = 2 (At,k'T _ /kt,k'|) /k,k' = 2 (At,k'T + /kt,k'|) ■ ( 101 ) 

/k,k' ma y be interpreted as an exchange interaction, or 

/k7,kV = /k,k' + °" * °"7k,k' ( 102 ) 

where cr and cr' are the Pauli matrices for the spins. 

The next important assumption entering Fermi liquid theory is that 
the quasiparticle interactions are short-ranged. This means that f^y is 
essentially constant as a function of \k\, \k'\, and so depends only on the 
directions k,k r , f^y — f(k,k'): 



fky — fk,k> 



e k =e k /= A t 



(103) 
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or equivalent ly, only on the angle 9 between them! 

oo 

/ k V = E/fA(cos0). (104) 

1=0 

Conventionally these f parameters are expressed in terms of dimension- 
less quantities. 

VN ff = ~^f? = F t a . (105) 

We expect that the Landau parameters F/' a will be negligible for suf- 
ficiently large so we have now achieved our goal of subsuming the 
complicated effects of interactions in a few phenomenological parame- 
ters. 

2.3.4 Local energy of a quasiparticle 

Now consider an interacting system with a certain distribution of excited 
quasiparticles Sny. Let's add another particle, say of momentum k 
and see how the energy changes. The distribution function goes from 
(Sn'y. — > Sn' k + ^k,k')- From Eq. 95 the free energy of the additional 
quasiparticle is 

ik = £k + E fy,kSny , (106) 
k' 

(since /k',k = /k,k')- The second term on the right corresponds to the 
average interaction of a quasiparticle with the other quasiparticles in the 
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system. 

Substituting (104) and (105) in (106) gives the shift in energy of a 
quasiparticle due to the Landau interaction 

= 6 - & = wl F ° 5n + aF o 5«» + w F rt ' » + -I- ( 107 ) 

where Sn = Eka S^ka is the density shift, 8n s = Eka crSrika is the spin 
density shift, and g = Eka kSrika is the number current. The first 
two terms can be thought of as molecular field couplings representing 
the average effect of density and magnetization fluctuations in the sur- 
rounding medium on the quasiparticle energy. The third term is the 
famous "backflow" term which describes the effect of the quasiparticle 
streaming through the medium to which it couples. The flow pattern 
is of dipolar form, as depicted in Figure 10. Note that the Landau pa- 
rameter F( is not completely independent, as it can be shown that in a 
Galilean invariant system it is directly related to the effective mass by 
the relation m*/m = 1 + F{/3 (see below). In a crystal this relation 
does not hold. 25 

Before leaving this section, let me just mention that the importance 
of the local quasiparticle energy £ or rather the energy shift 5£ is 

24 The structure may remind you of Hartree-Fock theory, to which it is similar, but keep in mind / is the residual 

interaction between quasiparticles, not electrons 

25 For an extensive discussion of this and many other aspects of FL theory I've glossed over, see G. Baym and 

C. Pethick, Landau Fermi-Liquid Theory : Concepts and Applications, Wiley 1991 
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Figure 10: Schematic of backflow around quasiparticle of momentum k 

that when one writes down how external forces couple to the system to 
determine, e.g. transport properties, it is the gradient of |k which is the 
force a qp of momentum k experiences due to spatial inhomogeneities 
in the system. There is a "Landau molecular field" proportional to 

-V^k = -VE/A(r) (108) 
k' 

which is responsible, e.g. for spin waves in dilute gases. Eq. (108) is a 
simplified version of Landau's famous kinetic equation for a Fermi liquid 
which is the analog of the Boltzman equation for a weakly interacting 
gas. 

2.3.5 Thermodynamic properties 

The expression (107) for the quasiparticle energies and the fact that the 
equilibrium quasiparticle distribution function is just the Fermi function 
enables immediate calculations of thermodynamic observables in the 
Fermi liquid. Because the states of the gas and liquid are in one to one 
correspondence, for example, one immediately knows that the expression 
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for the entropy of the liquid is the same as that for a free Fermi gas, 

S = - EKa log Tito + (1 - n ka ) log(l - n ka )\. (109) 
/co- 
Note now that the distribution function nt a in local equilibrium must 
be a Fermi function evaluated at energy e ka given by the bare energy 
plus the expression (106) in general. However, we are interested in 
calculating a quantity in global equilibrium, in which case the shifts 
Sn, 5n s ,g, ... vanish. In this case the calculation reduces to the usual 
one for a free Fermi gas. For temperatures T < Tj? we replace the 
sum Eka by No J dt; . The temperature can then be scaled out, and 
the remaining integral performed, yielding the entropy density at low 
temperatures, 

s = jW T, (110) 

and the specific heat at constant volume is therefore C = T(ds/dT)y = 
(ra*fcp/3)T, where in the last step we have used Nq = m*/cp/7r 2 . The 
result for the liquid is identical to that for the gas, with the exception 
of the replacement of m by m*. 

Similarly, we may derive an expression for the spin susceptibility of 
the Fermi liquid by constructing the magnetization 8n s as above, and 
noting that the shift in the distribution function is due to the Landau 

26 Recall the form of the Fermi distribution function for the free gas is obtained purely from combinatorial 
considerations, not from any knowledge of the energies. 
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shift in the quasiparticle energy (molecular field) as well as the external 
field H: 

5n ka = ^^{Se k(X - ho<j H) . (Ill) 

Using 8n s = Eka&Snka, and noting that since adn/de is isotropic and 
odd in a, only the i = 0, spin-antisymmetric part of 8ek a contributes to 

the sum, we find for the spin susceptibility Xo = l^o(dn s /dH) 

_ fi 2 N _ m*/m 
X ~ (1 + FS) " l + F aXo ' [ } 

where Xo is the susceptibility of the free gas. 



2.3.6 Quasiparticle relaxation time and transport proper- 
ties. 

Calculations of transport properties require introducing the quasiparti- 
cle lifetime. Recall the quasiparticle states are not eigenstates of the 
many-body Hamiltonian, due to the weak residual interactions among 
them. We therefore expect a quasiparticle with momentum k to decay 
except when it sits exactly at the Fermi surface and when T = 0, when 
it can easily be seen there is no phase space for scattering which con- 

— * 

serves energy and momentum. But relaxing these conditions (k not on 
FS, T ^ 0) allows phase space for scattering and hence a finite lifetime. 
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Since the interactions among quasiparticles are assumed weak, we may 
use a golden rule type formula for the scattering rate" 




Figure 11: 2-quasiparticle collision: states before (1,2) and after (3,4) 

Let's consider a quasiparticle with energy £1 = e\ — fi above the Fermi 
level. A scattering process takes it to final state £3, and to conserve en- 
ergy and momentum a quasiparticle with energy £2 must be scattered to 
final state £4. I've assumed the interaction V is independent of momen- 
tum, frequency, etc. because the most important effects we are looking 
for are purely statistical, unless V is very singular. 27 The "golden rule" 
then gives 

1 = 2tt J d&Nfa) J d&N{&) J dUN(u)\v\ 2 f(&)(i - - + 6 - 6 - ^ 

~ 27riV 3 |K 2 | / ?1 ^3 / ?1 ^ 3 ^4 = vriV 3 |K| 2 e i 2 , 
Jo Jo 

27 In the case of the Coulomb interaction, which is very singular, screening takes care of this problem. See 
section 2.2.6 
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where the delta function conserves energy and the Fermi factors ensure 
that there is a particle in state 2 and that states 3 and 4 are empty 
beforehand. For simplicity I evaluated the Fermi functions at T=0, 
giving the final simple result. Note I have neglected spin for the moment, 
but this is not essential. 28 Note now that for NoV <^ 1 (weak residual 
interactions) and No ~ it is clear that for £ ef the scattering 

satisfies the condition 

- « e (H3) 

This means that the quasiparticle states are well-defined near the Fermi 
surface, since their damping is small compared to the energy of the 
state. ( As expected, the relaxation rate exactly at the Fermi surface 
vanishes identically.) We therefore expect that Fermi liquid theory cal- 
culations in terms of a weakly interacting quasiparticle gas will be valid 
for frequencies and temperatures such that lj, T £f- 

Two further comments about the scattering rate are of potential im- 
portance. First, I give without proof the full result at finite tempera- 

28 What I have done which is not really Kosher, however, is to neglect the constraints placed on the scattering 
by momentum conservation. It turns out in three dimensions this is not terribly important, and one obtains the 
correct answer. It is a useful exercise to convince yourself, for example, that in 2D the relaxation rate goes like 
£ 2 log£. 
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tures: 



W) = U l{nT)2+e] ' (115) 

so that at finite T one finds that 1/r varies as T 2 or £ 2 , whichever is 
larger. Certain simple transport properties in a Fermi liquid reflect this 
directly. For example, the resistivity due to electron-electron scattering 
in a metal goes as T 2 at the lowest temperatures; it is usually masked 
by other, larger sources of scattering in ordinary metals, however. 30 The 
second remark is that if one were to find, for some microscopic model, 
that the relaxation rate was larger than the quasiparticle energy, the 
quasiparticles would be overdamped and the entire concept would be 
useless. One would then speak of an incoherent or "non- Fermi liquid" 

29 0ur simple calculation has to be modified to account for momentum conservation, expressed as 

k!-k 3 =k 4 -k 2 . (114) 

Since £1 and £2 are confined to a narrow shell around the Fermi surface, so too are £3 and £4. This can be seen 
in Fig. 11, where the requirement that ki k 3 = kj — k 2 limits the allowed states for particles 3 and 4. Taking 
ki fixed, allowed momenta for 2 and 3 are obtained by rotating ki — k 3 = k 4 k 2 ; this rotation is constrained, 
however, by the fact that particles cannot scatter into occupied states. This restriction on the final states reduces 
the scattering probability by a factor of £i/sf, but is the same restriction contained in the calculation above. 
Thus momentum conservation doesn't affect our T = estimate above, except to modify the prefactor. 

If these hand- waving arguments don't satisfy you (they shouldn't), see AGD or Fetter/Wallecka. 
30 Of course this is not quite right. The electron-electron interactions in a solid conserve crysal momentum, and 

thus the conductivity of a pure crystal would be infinite at any temperature unless one explicitly accounts for 

Umklapp processes. Thus it is not true for the electrical conductivity that a oc r, with r the 1-particle lifetime 

due to electron-electron interactions. However the Umklapp processes do also provide a transport rate varying 

as T 2 , so one often hears the not quite correct remark that the scattering rate in a Fermi liquid varies as T 2 , 

therefore so does the resistivity p. 
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system. If, on the other hand, one found that 1/r were to scale precisely 
with max(£, T) (instead of max(£ 2 , T 2 )), the quasiparticles would be in 
a certain sense critically damped, i.e. not quite well defined, but not 
completely washed out either. This is what is frequently referred to as 
a "marginal Fermi liquid" .[MFL] 

2.3.7 Effective mass m* of quasiparticles 

Here I give a quick derivation of the relation between the Landau pa- 
rameter F\s which appears in the current and the effective mass m*. 
The net momentum of the volume V of quasiparticles is 

P qp = 2\/^knk, net quasiparticle momentum (116) 

k 

also the total momentum of the Fermi liquid. Since the number of 
particles equals the number of quasiparticles, however, the quasiparticle 
and particle currents must also be equal 

J qp = J p = Vk^k net quasiparticle and particle current (117) 

k 

or, since the momentum is just the particle mass times this current 

P p = 21/m^Vknk net quasiparticle and particle momentum (118) 
k 

where v p = V^6k, is the velocity of the quasiparticle. So 

Ekn k = mEV fc 6 k n k (119) 
k k 
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Let's consider an arbitrary variation of rik, and remember that €k de- 
pends on nk, so that 

5e k = 2V Y. fk,k>Sn k > . (120) 

k 

For Eq. 119, this means that 

Ek£n k = raEVfce k £nk (121) 
k k 

+m2V EE V k {hySny) n k , 

k k' 

or integrating by parts (and renaming k — > k' in the last part), we get 

E-^k = EV^k^k (122) 
k rn k 

-2V / E Sn k f^yVyny , 
k,k' 

The usual variational argument says that since dn^ is arbitrary, we must 
have that the integrands are equal 

- = V k e k - 2V E / k ,k' Vk'rik' (123) 
m k ' 

At T = Vk'rik' = —k'5(k' — kp). The integral may now be evaluated 

by using the system isotropy in k space, and choosing k parallel z. As 

we mostly concerned with Fermi surface properties we take k = kp, and 

let 6 be the angle between k (or the z-axis) and k', and finally note that 

on the Fermi surface \ Vk^k\k=k F = vp = kp/m* . Thus, 

kp kp f k f2 dkdQ p - . N ,^ nA . 

— = —^+2V ———fek • k'5(k' - k F ) 124 
mm* J {27in) 6 
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However, since both k and k 7 are restricted to the Fermi surface k' = 
cos#, and evaluating the integral over /c, we get 



1 1 f d£l „ 



(125) 



m m* J (27r/i) 3 ' 

If we now sum over (hidden) spins only the symmetric part of / appears 
1 1 AWk F 



+ 



d(cos6>) f s {6) cos6>, 



(126) 



m m* (27r/i) 3 
Now use decomposition (104), orthonormality of Legendre polynomials, 
and def. of Landau parameters: 



r(9) = 

J dxP n (x)P m (x)dx = 
Noft = 



I 

2 r 
-0 r 



'nm 



2n + 1 

Vm*k F _ 

~3~Jl — r l J 



7T 



(127) 
(128) 

(129) 

(130) 



we find that 



1 



1 Ff 
— + 1 



m rrr 



3 ' 



(131) 



or m*/m = 1 + Ff/3. 

So effective mass and F\ s aren't independent, at least in a Galilean 
invariant system where (117) holds. 
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3 Quantum Magnetism 

The main purpose of this section is to introduce you to ordered 
magnetic states in solids and their "spin wave-like" elementary 
excitations. Magnetism is an enormous field, and reviewing it 
entirely is beyond the scope of this course. 



3i Introduction 

3.1.1 Atomic magnetic Hamiltonian 

The simplest magnetic systems to consider are insulators where 
electron-electron interactions are weak. If this is the case, the 
magnetic response of the solid to an applied field is given by the 
sum of the susceptibilities of the individual atoms. The mag- 
netic susceptibility is defined by the the 2nd derivative of the free 
energy, 1 

d 2 F 

X = - M 2- (1) 

We would like to know if one can understand (on the basis of an 
understanding of atomic structure) why some systems (e.g. some 

1 In this section I choose units such that the system volume V — 1. 
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elements which are insulators) are paramagnetic (% > 0) and 
some diamagnetic (x < 0). 

The one-body Hamiltonian for the motion of the electrons in the 
nuclear Coulomb potential in the presence of the field is 



Hatom 



1 ^ 

2^ ? (Pi + ^A(r,)) + E V(n) + • S, (2) 



where Ef S$ is the total spin operator, //_g = e/(2mc) is the Bohr 
magneton, go ~ 2 is the gyromagnetic ratio, and A = — |r x H 
is the vector potential corresponding to the applied uniform field, 
assumed to point in the z direction. Expanding the kinetic energy, 
H atom may now be expressed in terms of the orbital magnetic 
moment operator L = E« r x as 

H atom = E 7T- + E V(n) + SH, (3) 

i ZTTl i 

SH = /*b(L + 3o S) • H + ^H 2 Ux 2 + yf). (4) 

Given a set of exact eigenstates for the atomic Hamiltonian in 
zero field |n) (ignore degeneracies for simplicity), standard per- 
turbation theory in SH gives 

SE n = H • (n\p\n) + £ I ^ ' ^ ~ + H*? + tiMtf) 

where p = //b(L + poS). It is easy to see that the first term 
dominates and is of order the cyclotron frequency uj c = eH/ (mc) 
unless it vanishes for symmetry reasons. The third term is of order 
u c / (e 2 /ao) smaller, because typical electron orbits are confined to 
atomic sizes ao, and is important in insulators only when the state 
\n) has zero magnetic moment (L = S = 0). Since the coefficient 
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of H 2 is manifestly positive, the susceptibility 2 in the /i = 
ground state |0) is % = —d 2 5Eo/dH 2 , which is clearly < 0, i.e. 
diamagnetic. 3 

In most cases, however, the atomic shells are partially filled, and 
the ground state is determined by minimizing the atomic en- 
ergy together with the intraatomic Coulomb interaction needed 
to break certain degeneracies, i.e. by "Hund's rules" 4 Once this 
ground state (in particlular the 5, L, and J quantum numbers is 
known, the atomic susceptibility can be calculated. The simplest 
case is again the J = case, where the first term in (5) again 
vanishes. Then the second term and third term compete, and 
can result in either a diamagnetic or paramagnetic susceptibility. 
The 2nd term is called the Van Vleck term in the energy. It is 
paramagnetic since its contribution to % is 

d 2 E \ 2n d term n 2 ^ K !^!™')! 2 ^ n , a \ 

Xw = Qjp- = £ > 0. (6) 

3.1.2 Curie Law-free spins 

In the more usual case of J ^ 0, the ground state is 2 J + 1 
degenerate, and we have to be more careful about defining the 
susceptibility. The free energy as T — > can no longer be replaced 
by Eq as we did above. We have to account for the entropy of the 
2 J + 1 degenerate spin states as well. Applying a magnetic field 
breaks this degeneracy, so we have a small statistical calculation 

2 If the ground state is nondegenerate, we can replace F = E—TS in the definition of the susceptibility 

by the ground state energy Eq. 

3 This weak diamagnctism in insulators with filled shells is called Larmor diamagnetism 

4 See A&M or any serious quantum mechanics book. I'm not going to lecture on this but ask you 

about it on homework. 
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to do. The energies of the "spin" in a field are given by 

H = -jl-H, (7) 

and since p = — 7 J within the subspace of definite J 2 , 5 the 2 J + 1 
degeneracy in H = is completely broken. The free energy is 



F = -ThgZ = -Thg £ e^ HJz 



= -Thg 



Jz=~J 

^HiJ+1/2) _ e -/3 7 tf(J+l/2) 



(8) 



e PlH/2 _ e -hH/2 

so the magnetization of the free spins is 

M = ~ = 1 JB{j3 1 JH), (9) 

where B(x) is the Brillouin function 

^ 2J+1 2J+1 1 1 
B(x) = -jj— coth —^j— x - coth 7Tj x - ( 10 ) 

Note I defined 7 = Hb9- We were particularly interested in the 
H —> case, so as to compare with ions with filled shells or 
J = 0; in this case one expands for T » jH, i.e. cothx ~ 
l/x + x/3 + . . ., B(x) ~ ( J + l)x/(3J) to find the susceptibility 



d 2 F _^ 7 2 J(J+1) 



(11) 



i.e. a CWze /au> for the high-temperature susceptibility. A \/T 
susceptibility at high T is generally taken as evidence for free 
paramagnetic spins, and the size of the moment given by /j 2 = 
7 2 J(J+1). 

5 This is not obvious at first sight. The magnetic moment operator jl oc L + goS is not proportional 
to the total angular momentum operator J = L + S. However its matrix elements within the subspace 
of definite L, S, J are proportional, due to the Wigner-Eckart theorem. One therefore may assume the 
proportionality within the subspace, where the proportionality const. 7 = g(J, L, S)jiB- 9 is called 
Lande g-factor, and is independent of J z according to Wigner-Eckart. See e.g. Ashcroft & Mermin p. 
654. 
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3.1.3 Magnetic interactions 



The most important interactions between magnetic moments in 
an insulator are electrostatic and inherently quantum mechanical 
in nature. From a classical perspective, one might expect two 
such moments to interact via the classical dipolar force, giving 
a potential energy of configuration of two moments pi and p2 
separated by a distance r of 

jj = Pi • P2 - ^{Pi • r){fii • r) 

^3 

Putting in typical atomic moments of order \ib = eh/mc and 
distances of order a Bohr radius r ~ ao = h 2 /me 2 , we find 
U ~ 10~ A eV, which is very small compared to typical atomic 
energies of order eV . Quantum-mechanical exchange is almost 
aways a much larger effect, and the dipolar interactions are there- 
fore normally neglected in the discussions of magnetic interac- 
tions in solids. Exchange arises because, e.g., two quantum me- 
chanical spins 1/2 (in isolation) can be in either a spin triplet 
(total 5 = 1), or singlet (total 5 = 0). The spatial part of 
the two-particle wavefunctions is antisymmetric for triplet and 
symmetric for singlet, respectively. Since the particles carrying 
the spins are also charged, there will be a large energetic dif- 
ference between the two spin configurations due to the different 
size of the Coulomb matrix elements (the "amount of time the 
two particles spend close to each other") in the two cases. b In 
terms of hypothetical atomic wave functions ^ a ( r ) and ^(r) 
for the two particles, the singlet and triplet combinations are 
^o(ri,r 2 ) = ^ a (ri)^(r 2 ) ± ^ a (r 2 )^&(ri), so the singlet-triplet 

6 Imagine moving 2 H-atoms together starting from infinite separation. Initially the 3 S = 1 states 
and 15 = states must be degenerate. As the particles start to interact via the Coulomb force at very 
long range, there will be a splitting between singlet and triplet. 
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splitting is approximately 



-J = E Q -E l = (0\H\0) - (1\H\1) 

where 1/ represents the Coulomb interactions between the parti- 
cles (and possible other particles in the problem). 7 

We'd now like to write down a simple Hamiltonian for the spins 
which contains the physics of this exchange splitting. This was 
done first by Heisenberg (check history here), who suggested 

H2-spin = J&1 ' $2 (15) 

You can easily calculate that the energy of the triplet state in this 
Hamiltonian is J/4, and that of the singlet state —3 J/4. So the 
splitting is indeed J. Note that the sign of J in the H2 case is 
positive, meaning the 5 = state is favored; the interaction is 
then said to be antiferromagnetic, meaning it favors antialigning 
the spins with each other. 8 

The so-called Heitler- London model of exchange just reviewed 
presented works reasonably well for well-separated molecules, but 
for N atoms in a real solid, magnetic interactions are much more 
complicated, and it is in general not sufficient to restrict one's 
consideration to the 4-state subspace (singlet 3 components of 

7 For example, in the H molecule, 

p2 2 2 2 

y(n,r 2 ) = + - e - e (14) 

|ri-r 2 | |Ri-R 2 | |ri-Ri| |r 2 -R 2 | 

where Ri and R 2 are the sites of the protons. Note the argument above would suggest naively that the 
triplet state should be the ground state in the well-separated ion limit, because the Coulomb interaction 
is minimized in the spatially antisymmetric case. However the true ground state of the i? 2 molecule is 
the Heitler-London singlet state V«( r i> r 2) ~ ^(ri)^^) + V , o( r 2)V , b( r i) 

historically the sign convention for J was the opposite; J > was usually taken to be ferromagnetic, 
i.e. the Hamiltonian was defined with another minus sign. I use the more popular convention H = 
JJ2St-Sj. Be careful! 
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triplet) to calculate the effective exchange. In many cases, par- 
ticularly when the magnetic ions are reasonably well-separated, it 
nevertheless turns out to be ok to simply extend the 2-spin form 
(15) to the entire lattice: 

H=JT,S i -S i+s (16) 

id 

where J is the exchange constant, i runs over sites and S runs 
over nearest neighbors. 9 This is the so-called Heisenberg model. 
The sign of J can be either antiferromagnetic ( J > in this con- 
vention), or ferromagnetic (J < 0). This may lead, at sufficiently 
low temperature, to a quantum ordered state with ferromagnetic 
or antiferromagnetic- type order. Or, it may not. A good deal de- 
pends on the type and dimensionality of the lattice, as we discuss 
below. 

1 1 1 1 1 1 tmu 
1 1 1 1 1 1 it 1 1 it 

1 1 1 1 1 1 tititi 
1 1 1 1 1 1 u it it 

a) b) 

Figure 1: a) Ferromagnetic ordering; b) antiferromagnetic ordering. 



Although oversimplified, the Heisenberg model is still very difficult 
to solve. Fortunately, a good deal has been learned about it, and 

9 One has to be a bit careful about the counting. J is defined conventionally such that there is one 
term in the Hamiltonian for each bond between two sites. Therefore if i runs over all sites, one should 
have 5 only run over, e.g. for the simple cubic lattice, +x, +y, and +z. If it ran over all nearest 
neighbors, bonds would be double-counted and we would have to multiply by 1/2. 
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once one has put in the work it turns out to describe magnetic 
ordering and magnetic correlations rather well for a wide class of 
solids, provided one is willing to treat J as a phenomenological 
parameter to be determined from a fit to experiment. 

The simplest thing one can do to take interactions between spins 
into account is to ask, "What is the average exchange felt by 
a given spin due to all the other spins?" This is the essence of 
the molecular field theory or mean field theory due to Weiss. 
Split off from the Hamiltonian all terms connecting any spins to a 
specific spin Sj. For the nearest-neighbor exchange model we are 
considering, this means just look at the nearest neighbors. This 
part of the Hamiltonian is 



6 Hi = Si 



J E Si+5 

5 



-/7 H, (17) 



where we have included the coupling of the spin in question to the 
external field as well. We see that the 1-spin Hamiltonian looks 
like the Hamiltonian for a spin in an effective field, 

SHt = -Pi-Heff, (18) 

He// = £S, + , (19) 

Note this "effective field" as currently defined is a complicated 
operator, depending on the neighboring spin operators Sj +( $. The 
mean field theory replaces this quantity with its thermal average 

H e// = H-— E(S, + s> (20) 

= H " W™> (21) 
where the magnetization M = (/i) and z is the number of nearest 
neighbors again. But since we have an effective one-body Hamil- 
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tonian, thermal averages are supposed to be computed just as 
in the noninteracting system, cf. (9), but in the ensemble with 
effective magnetic field. Therefore the magnetization is 

M = 1 SB{p 1 SH eff ) = 1 SB [p-)S[H - ^M]j . (22) 

This is now a nonlinear equation for M, which we can solve for 
any H and T. It should describe a ferromagnet with finite spon- 
taneous (H —> 0) magnetization below a critical temperature T c . 
So to search for T c , set H = and expand B(x) for small x: 



M = - lS B (%M) * -,S S -£ £ M (2 3) 

=, T c = ^^(-J), (24) 

So the critical temperature in this mean field theory (unlike the 
BCS mean field theory!) is of order the fundamental interaction 
energy \J\. We expect this value to be an upper bound to the true 
critical temperature, which will be supressed by spin fluctuations 
about the mean field used in the calculation. 

Below T c , we can calculate how the magnetization varies near the 
transition by expanding the Brillouin fctn. to one higher power 
in x. The result is 

M - (T - T c ) 1/2 . (25) 

Note this exponent 1/2 is characteristic of the disappearance of 
the order parameter near the transition of any mean field theory 
(Landau) . 
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Figure 2: Plot of Mathematica solution to eqn. (22) for M vs. T using — J — g — 1; z — 
4;S— 1/2. T c —1 for this choice. Upper curve: H — 0.1; lower curve: H = 0. 



3.2 Ising model 

The Ising model 10 consists of a set of spins Si with ^-components 
only localized on lattice sites i interacting via nearest-neighbor 
exchange J < 0: 



Note it is an inherently classical model, since all spin commuta- 
tors vanish [Si, Sj] = 0. Its historical importance consisted not so 
much in its applicability to real ferromagnetic systems as in the 
role its solutions, particularly the analytical solution of the 2D 
model published by Onsager in 1944, played in elucidating the na- 
ture of phase transitions. Onsager's solution demonstrated clearly 
that all approximation methods and series expansions heretofore 
used to attack the ferromagnetic transition failed in the critical 
regime, and thereby highlighted the fundamental nature of the 
problem of critical phenomena, not solved (by Wilson and others) 
until in the early 70's. 



10 Thc "Ising model" was developed as a model for ferromagnetism by Wilhclm Lcnz and his student 
Ernst Ising in the early '20's. 



Iri = J SiSj — 2(j,bH Si. 



(26) 
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3.2.1 Phase transition/critical point 

We will be interested in properties of the model (26) at low and at 
high temperatures. What we will find is that there is a tempera- 
ture T c below which the system magnetizes spontaneously, just as 
in mean field theory, but that its true value is in general smaller 
than that given by mean field theory due to the importance of 
thermal fluctuations. Qualitatively, the phase diagram looks like 
this: 



H 




Figure 3: Field-magnetization curves for three cases. Mq is spontaneous magnetization 
in ferromagnetic phase T <T C . 

Below T c , the system magnetizes spontaneously even for field 
H — ► 0. Instead of investigating the Onsager solution in detail, 
I will rely on the Monte Carlo simulation of the model developed 
by Jim Sethna and Bob Silsbee as part of the Solid State Sim- 
ulation (SSS) project. The idea is as follows. We would like to 
minimize F = T log Tr exp —f3H for a given temperature and ap- 
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plied field. Finding the configuration of Ising spins which does so 
is a complicated task, but we can imagine starting the system at 
high temperatures, where all configurations are equally likely, and 
cooling to the desired temperature T. 11 Along the way, we allow 
the system to relax by "sweeping" through all spins in the finite 
size lattice, and deciding in the next Monte Carlo "time" step 
whether the spin will be up or down. Up and down are weighted 
by a Boltzman probability factor 

±fiH e ff/T 

P(S, = ±1/2) = e ^ Heff/T + e , Heff/T , (27) 

where Hf^ is the effective field defined in (19). The simulation 
picks a spin Si in the next time step randomly, but weighted with 
these up and down probabilities. A single "sweep" (time step) 
consists of L x L such attempts to flip spins, where L is the size 
of the square sample. Periodic boundary conditions are assumed, 
and the spin configuration is displayed, with one color for up and 
one for down. 

Here are some remarks on Ising critical phenomena, some of 
which you can check yourself with the simulation: 

• At high temperatures one recovers the expected Curie law 

• The susceptibility diverges at a critical temperature below 
the mean field value. 12 Near, but not too near, the transition 
X has the Curie- Weiss form % ~ (T — T c ) -1 . 

• With very careful application of the simulation, one should 
obtain Onsager's result that very near the transition ("critical 

This procedure is called simulated annealing 
12 This is given as a homework problem. Note the value of J used in the simulation is 1/4 that defined 
here, since S"s are ±1. 
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regime" ) 

M ~ (T c - Tf, (28) 
with /3 = 1/8. The susceptibility actually varies as 

X~|T-T c |-t, (29) 

with 7 = 7/4. Other physical quantities also diverge near the 
transition, e.g. the specific heat varies as \T — T c \^ a 1 with 
a = (log divergence) . 

There is no real singularity in any physical quantity so long 
as the system size remains finite. 

The critical exponents a, f3, 7... get closer to their mean field 
values ((3 = 1/2, a =, 7 =,... ) as the number of nearest 
neighbors in the lattice increases, or if the dimensionality of 
the system increases. 

The mean square size of themal magnetization fluctuations 
gets very large close to the transition ( "Critical opalescence" , 
so named for the increased scattering of light near the liquid- 
solid critical point) . 

Magnetization relaxation gets very long near the transition 
( "Critical slowing down" ) . 

In ID there is no finite temperature phase transition, although 
mean field theory predicts one. See next section. 
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3.2.2 ID solution by transfer matrix 



3.2.3 Ferromagnetic domains 

3.3 Ferromagnetic magnons 

Let's consider the simplest example of an insulating ferromagnet, 
described by the ferromagnetic Heisenberg Hamiltonian 



where J < is the ferromagnetic exchange constant, i runs over 
sites and 5 runs over nearest neighbors, and Hq is the magnetic 
field pointing in the z direction. It is clear that the system can 
minimize its energy by having all the spins S align along the z 
direction at T = 0; i.e. the quantum ground state is identical to 
the classical ground state. Finding the elementary excitations of 
the quantum many-body system is not so easy, however, due to 
the fact that the spin operators do not commute. 

3.3.1 Holstein-Primakoff transformation 

One can attempt to transform the spin problem to a more stan- 
dard many-body interacting problem by replacing the spins with 
boson creation and annihilation operators. This can be done ex- 
actly by the Holstein-Primakoff transformation 13 



(30) 




Si 






(2S) 1/2 1 



{2S) l ' 2 a\ 1 



V 



V 




(32) 



(31) 



T. Holstein and H. Primakoff, Phys. Rev. 58, 1098 (1940). 
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Verify for yourselves that these definitions Sf give the correct 

A A A 

commutation relations [S x , S y ] = iS z z/the bosonic commutation 
relations [a, a)] = 1 are obeyed on a given lattice site. Note also 
that the operators which commute with the Hamiltonian are 5 

A 

and S z as usual, so we can classify all states in terms of their 
eigenvalues 5(5+1) and S z . To complete the algebra we need a 

A 

representation for S Zl which can be obtained by using the identity 
(on a given site i) 

S 2 Z = 5(5 + 1) - l - (5+5" + 5"5+) . (33) 

Using (32) and some tedious applications of the bosonic commu- 
tation relations, we find 

S Z = S- a) a. (34) 

Now since the system is periodic, we are looking for excitations 
which can be characterized by a well-defined momentum (crystal 
momentum) k, so we define the Fourier transformed variables 

a^^Ee-^&k + = ^I+ k - x '&L (35) 

where as usual the F.T. variables also satisfy the bosonic relations 
[frk? frk'] = <W> etc Looking forward a bit, we will see that the 
operators and 6k create and destroy a magnon or spin-wave 
excitation of the ferromagnet. These turn out to be excitations 
where the spins locally deviate only a small amount from their 
ground state values (|| z) as the "spin wave" passes by. This sug- 
gests a posteriori that an expansion in the spin deviations aja^ 
(see (34)) may converge quickly. Holstein and Primakoff therefore 
suggested expanding the nasty square roots in (32), giving 
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(25) 



1/2 



a. 






V 



45 



+ . 



/25\ 1/2 



Si * 



(25) 



1/2 



k 

aj - 



ikR, 



»k -t^ £ e'( k - k '- k ")' R 46 k ,6 k , + 



"45 



457V 



+ . 



•(3,6) 



k,k',k" 



/25\ 1/2 



E e ik M - -L- e e i ( k+k '- k ")- R ^[6Uk" + 



457V 



k,k',k" 



4 = 5 - aja, = 5 - ^ £ e^-^H^k, 

7V kk' 



C37) 
(38) 



Note that the expansion is formally an expansion in 1/5, so 
we might expect it to converge rapidly in the case of a large-spin 
system. 14 The last equation is exact, not approximate, and it is 
useful to note that the total spin of the system along the magnetic 
field is 

S z ,tot = E S z = NS - -i E &t&k, (39) 

i N k 

consistent with our picture of magnons tipping the spin away from 
its T = equilibrium direction along the applied field. 



3.3.2 Linear spin wave theory 

The idea now is to keep initially only the bilinear terms in the 
magnon operators, leaving us with a soluble Hamiltonian, hoping 
that we can then treat the 4th-order and higher terms pertur- 



14 For spin-1/2, the case of greatest interest, however, it is far from obvious that this uncontrolled 
approximation makes any sense, despite the words we have said about spin deviations being small. Why 
should they be? Yet empirically linear spin wave theory works very well in 3D, and surprisingly well in 
2D. 
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batively. 15 Simply substitute (36)- (38) into (30), and collect the 
terms first proportional to 5 2 , 5, 1, 1/5, etc. We find 

H = \jNzS 2 - 2fi B H S + HT gnan + 0(1), (40) 

where z is the number of nearest neighbors (e.g. 6 for simple cubic 
lattice), and 



Tjmagnon _ ° ^ \ - 

~ ~N ^ 



e-^ k - k '> R ^ k '- s b k bl + e^- k '> R *e- lk '- s bib k , 



-e^'^bibv - e^ k - k '>^ + %tbJ + ^1 E ^(k-k')-R^ 
= JzS £ Wb\A + 7-kfe - + 2fi B H £ 



k 



= £ [-2.7*5(1 - 7 k) + 2/z B #o] fe, 
k 



where 



7fc = -Ee iH (42) 

is the magnon dispersion function, which in this approximation 
depends only on the positions of the nearest neighbor spins. Note 
in the last step of (41), I assumed 7k = 7-k, which is true for lat- 
tices with inversion symmetry. For example, for the simple cubic 
lattice in 3D with lattice constant a, 7k = (cos k x a + cos k y a + 
cos k z a)/3, clearly an even function of k. Under these assump- 
tions, the magnon part of the Hamiltonian is remarkably sim- 
ple, and can be written like a harmonic oscillator or phonon-type 
Hamiltonian, n™ a v non — EkW^k, where = frj^k is the num- 
ber of magnons in state k, and 

u k = -JSz{\ - 7k ) + 2fi B H (43) 

15 physically these "nonlinear spin wave" terms represent the interactions of magnons, and resemble 
closely terms representing interactions of phonons in anharmonic lattice theory 
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is the magnon dispersion. The most important magnons will be 
those with momenta close to the center of the Brillouin zone, 
k rsj 0, so we need to examine the small- k dispersion function. For 
a Bravais lattice, like simple cubic, this expansion gives 1 — 7k — 
/c 2 , 16 i.e. the magnon dispersion vanishes as k — > 0. For more 
complicated lattices, there will be solutions with —> const. 
There is always a "gapless mode" —> as well, however, since 
the existence of such a mode is guaranteed by the Goldstone the- 
orem. 17 The figure shows a simple ID schematic of a spin wave 




TTT77T 

Figure 4: Real space picture of spin deviations in magnon. Top: ordered ground state 

with wavelength A = 2n/k corresponding to about 10 lattice sites. 
The picture is supposed to convey the fact that the spin devia- 
tions from the ordered state are small, and vary slightly from site 
to site. Quantum mechanically, the wave function for the spin 
wave state contains at each site a small amplitude for the spin 
to be in a state with definite S x and/or S y . This can be seen by 
inverting Eq. (37) to leading order, & noting that the spin wave 
creation operator 6 k lowers the spin with S~ = S x — iS y with 
phase e~ tk ' Rt and amplitude ~ 1/5 at each site i. 

16 Check for simple cubic! 

17 For every spontaneously broken continuous symmetry of the Hamiltonian there is a Wk^o = mode. 
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3.3.3 Dynamical Susceptibility 



Experimental interlude 

The simple spin wave calculations described above (and below) 
are uncontrolled for spin- 1/2 systems, and it would be nice to 
know to what extent one can trust them. In recent years, numeri- 
cal work (exact diagonalization and quantum Monte Carlo) tech- 
niques have shown, as noted, that linear spin wave calculations 
compare suprisingly well with such "exact" results for the Heisen- 
berg model. But we still need to know if there are any physical 
systems whose effective spin Hamiltonian can really be described 
by Heisenberg models. In addition, keep in mind that the utility 
of spin wave theory was recognized long before such numerical 
calculations were available, mainly through comparison with ex- 
periments on simple magnets. The most useful probe of magnetic 
structure is slow neutron scattering, a technique developed in the 
40's by Brockhouse and Schull (Nobel prize 1994). This section is 
a brief discussion of how one can use neutron scattering techniques 
to determine the dispersion and lifetimes of magnetic excitations 
in solids. 18 

Neutrons scatter from solids primarily due to the nuclear strong 
force, which leads to nonmagnetic neutron-ion scattering and al- 
lows structural determinations very similar to x-ray diffraction 
analysis. In addition, slow neutrons traversing a crystal can emit 
or absorb phonons, so the inelastic neutron cross-section is also 
a sensitive measure of the dispersion of the collective modes of 
the ionic system. 19 There is also a force on the neutron due to 

18 A complete discussion is found in Lovesey, Theory of Neutron Scattering from Condensed Matter, 
Oxford 1984, V. 2 
19 cf. Ashcroft & Mermin ch. 24 
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the interaction of its small magnetic dipole moment with the spin 
magnetic moment of the electrons in the solid. There are there- 
fore additional contributions to the peaks in the elastic neutron 
scattering intensity (at the Bragg angles) corresponding to mag- 
netic scattering if the solid has long-range magnetic order; they 
can be distinguished from the nonmagnetic scattering because the 
additional spectral weight is strongly temperature dependent and 
disappears above the critical temperature, or through application 
of an external magnetic field. Furthermore, in analogy to the 
phonon case, inelastic neutron scattering experiments on ferro- 
magnets yield peaks corresponding to processes where the neu- 
tron absorbs or emits a spin wave excitation. Thus the dispersion 
relation for the magnons can be mapped out. 20 

I will not go through the derivation of the inelastic scattering cross 
section, which can be found in Lovesey's book. It is similar to the 
elementary derivation given by Ashcroft & Mermin in Appendix 
N for phonons. The result is 

(^) roe r^{^ (q) l 2e " W(q)(1+6M) 

-N 

x— 7? £(<W - q a qp) Im Xapfa ~^)(44) 

where ao is the Bohr radius, k and k 7 are initial and final wave 
vector, q = k — k', F(q) atomic form factor, e~ 2W ^ the Debye- 
Waller factor, and b(u) the Bose distribution function, N the 
number of unit cells, and u is the energy change of the neuton, 

20 Even in systems without long range magnetic order, neutron experiments provide important infor- 
mation on the correlation length and lifetime of spin fluctuations. In strongly correlated systems (e.g. 
magnets just above their critical temperature, or itinerant magnets close to a magnetic transition as in 
Sec. xxx), these can be thought of as collective modes with finite lifetime and decay length. This means 
the correlation function (Sf (f)S") is not const, as t, |Rj — Rj| — ► oo, but may fall off very slowly, as 
"power laws" t-f 3 , |R, - Rj|" 7 . 
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k 2 / (2m) — k' 2 1 (2m). The physics we are interested in is contained 
in the imaginary part of the dynamic susceptibility %(q, uj). For 
uj < 0, this measures the energy loss by neutrons as they slow 
down while emitting spin waves in the solid; for u > the neu- 
trons are picking up energy from thermally excited spin waves. 

By spin rotational invariance, all components \ zz = Xxx = Xyy 
are equal. It is then most convenient to calculate — within linear 
spin wave theory — the transverse spin susceptibility 

X?_(R* - R,, t) = -Tr (p[S+(t), S"]) 0(t) (45) 

and then its Fourier transform wrt momentum q and frequency 
uj. I won't do this calculation explicitly, but leave it as an exercise. 
Note it follows exactly the method used to calculate the charge 
susceptibility on p. 22 of the previous section. You express the S 
operators in terms of the bkS, whose time dependence is exactly 
known since the approximate Hamiltonian is quadratic. At the 
end, after Fourier transforming, one recovers 



2S. 



N 



1 



x + -(q,^) = (—) - — : ^- • (46) 



\uj + uj q + i0 + J 



Again, as for the Fermi gas, we see that the collective modes of 
the system are reflected as the poles in the appropriate response 
function. 

The final cross section is now proportional to 

Im x+_(q, -uj) ~5(uj + uj q ), (47) 

i.e. there is a peak when a magnon is absorbed (neutron's en- 
ergy k' 2 /{2m) is larger than initial energy k 2 /(2m) =^> u = 
k 2 /(2m) — k' 2 /(2m) < 0.). There is another similar contribution 
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proportional to 5{uj — (emission) coming from X-+- Thus the 
dispersion uj q can be mapped out by careful measurement. 



Figure 5: Neutron scattering data on ferromagnet. I searched a bit but couldn't come 
up with any more modern data than this. This is Fig. 1, Ch. 4 of Kittel, Magnon 
dispersions in magnetite from inelastic neutron scattering by Brockhouse (Nobel Prize 
1994) and Watanabe. 



The one-magnon lines are of course broadened by magnon- 
magnon interactions, and by finite temperatures (Debye- Waller 
factor). There are also multimagnon absorption and emission 
processes which contribute in higher order. 

3.4 Quantum antiferromagnet 

Antiferromagnetic systems are generally approached by analogy 
with ferromagnetic systems, assuming that the system can be di- 
vided up into two or more sublattices, i.e. infinite interpenetrating 
subsets of the lattice whose union is the entire lattice. Classically, 
it is frequently clear that if we choose all spins on a given judi- 
ciously chosen sublattice to be aligned with one another, we must 
achieve a minimum in the energy. For example, for the classical 
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AF Heisenberg model H = J S j • with J > on a square 
lattice, choosing the A-B sublattices in the figure and making all 
spins up on one and down on another allows each bond to achieve 
its lowest energy of —JS 2 . This state, with alternating up and 
down spins, is referred to as the classical Need state. Similarly, it 
may be clear to you that on the triangular lattice the classical low- 
est energy configuration is achieved when spins are placed at 120° 
with respect to one another on the sublattices A,B,C. However, 

® © ® © 

© ® © ® square lattice 

® © ® © 

© ® © 

triangular ® © © ® 
lattice 

© ® © 
® © © ® 

Figure 6: Possible choice of sublattices for antiferro magnet 

quantum magnetic systems are not quite so simple. Consider the 
magnetization Ma on a given sublattice (say the A sites in the 
figure) of the square lattice; alternatively one can define the stag- 
gered magnetization as M s = 1)* (S^) (Note (— l) 1 means +1 
on the A sites and —1 on the B sites.) Either construct can be 
used as the order parameter for an antiferromagnet on a bipartite 
lattice. In the classical Need state, these is simply Ma = NS/2 
and M s = NS, respectively, i.e. the sublattice or staggered mag- 
netization are saturated. In the wave function for the ground 
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state of a quantum Heisenberg antiferromagnet, however, there is 
some amplitude for spins to be nipped on a a given sublattice, due 
to the fact that for a given bond the system can lower its energy 
by taking advantage of the S x S f x + S y S' y terms. This effect can 
be seen already by examining the two- spin 1/2 case for the ferro- 
magnet and antiferromagnet. For the ferromagnet, the classical 
energy is — | J\S 2 = — | J|/4, but the quantum energy in the total 
spin 1 state is also — | J|/4. For the antiferromagnet, the classical 
energy is —JS 2 = — J/4, but the energy in the total spin quan- 
tum mechanical state is —3 J/4. So quantum fluctuations-which 
inevitably depress the magnetization on a given sublattice-lower 
the energy in the antiferromagnetic case. This can be illustrated 
in a very simple calculation of magnons in the antiferromagnet 
following our previous discussion in section 7.2. 

3.4.1 Antiferromagnetic magnons 

We will follow the same procedure for the ferromagnet on each 
sublattice A and B, defining 




A (48) 



Bi (50) 



(49) 



(51) 
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^tz — S — A\Ai 

>Fz = q - p] 



~Sfz ~ ^ — B, Bi, 



(52) 
(53) 



i.e. we assume that in the absence of any quantum fluctuations 
spins on sublattice A are up and those on B are down. Otherwise 
the formalism on each sublattice is identical to what we did for 
the ferromagnet. We introduce on each sublattice creation & 
annihilation operators for spin waves with momentum k: 

1 . „-l,.T5 . •)• 1 
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(54) 
• (55) 



In principle k takes values only in the 1st magnetic Brillouin 
zone, or half-zone, since the periodicity of the sublattices is twice 
that of the underlying lattice. The spin operators on a given site 
are then expanded as 
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The expansion of the Heisenberg Hamiltonian in terms of these 
variables is now (compare (40)) 

H = -NzJS 2 + Hr gn ° n + 0{l) : (62) 

Hr non = JzS [ 7k( 4 5 t + AkBk) + {AkAk + Bk) " 

(63) 

Unlike the ferromagnetic case, merely expressing the Hamiltonian 
to bilinear order in the magnon variables does not diagonalize it 
immediately. We can however perform a canonical transforma- 
tion 21 to bring the Hamiltonian into diagonal form (check!): 

Oik = u k A k - v k B ] k ; a{ = u k A[ - v k B k (64) 
A = ^k^k - VkA[ ; (3 ] k = u k B ] k - v k A k , (65) 

where the coefficients Uk, Vk must be chosen such that u k — v k = 1. 
One such choice is u k = cosh#k and v k = sinh^k- For each k, 
choose the angle 6k such that the anomalous terms like A k B k 
vanish. One then finds the solution 



and 



where 



tanh 29 k = —7k, (66) 



K ag " m = -NEj + E ^k(4«k + 4A + 1), (67) 

k 



"l = E 2 J (l- 1 D, (68) 



and Ej = JzS. Whereas in the ferromagnetic case we had u k ~ 
(1 — 7k) ~ k 2 , it is noteworthy that in the antiferromagnetic case 

21 Thc definition of a canonical transformation, I remind you, is one which will give canonical commu- 
tation relations for the transformed fields. This is important because it ensures that we can interpret 
the Hamiltonian represented in terms of the new fields as a (now diagonal) fermion Hamiltonian, read 
off the energies, etc. 
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Figure 7: Integrated intensity of (100) Bragg peak vs. temperature for LaCu04, with 
T N = 195K. (After Shirane et al. 1987) 

the result uj^ ~ (1 — 7k) 1 / 2 ~ k gives a linear magnon dispersion 
at long wavelengths (a center of symmetry of the crystal must be 
assumed to draw this conclusion). Note further that for each k 

,1 ,i i i TT maqnon 

there are two degenerate modes in ii 



3.4.2 Quantum fluctuations in the ground state 

At T = 0, there are no thermally excited spin wave excitations. 
Nevertheless the spin wave theory yields an decrease of the ground 
state energy relative to the classical value —NJzS 2 , but an in- 
crease over the quantum ferromagnet result of — N\ J\S(S + 1) 
due to the zero-point (constant) term in (67). 22 The ground-state 
energy is 

Eq~-NEj + Y.U)l. (69) 

k 

22 Recall the classical Neel state, which does not contain such fluctuations, is not an eigenstate of the 
quantum Heisenbcrg Hamiltonian. 
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Figure 8: a) Inelastic neutron scattering intensity vs. energy transfer at 296K near zone 
boundary Q = (1, k, 0.5) for oriented LaCu04 crystals with Neel temperature 260K; b) 
spin- wave dispersion cu q vs. q\\, where gy is in-plane wavevector. (after Hayden et al. 
1991) 

The result is usually expressed in terms of a constant f3 defined 
by 

E = -NJzS (s + ^ , (70) 

and (3 = 0.58 for 3D and xxx for 2D. 

Quantum fluctuations have the further effect of preventing the 
staggered magnetization from achieving its full saturated value of 
5, as in the classical Neel state, as shown first by Anderson. 23 
Let us consider the average ^-component of spin in equilibrium 
at temperature T, averaging only over spins on sublattice A of 
a .D-dimensional hypercubic lattice. From (60), we have (Sf) = 
S — N~ x Ek(^k^-k) within linear spin wave theory. Inverting the 
transformation (65), we can express the As in terms of the a's 
and /3's, whose averages we can easily calculate. Note the 0th 
order in 1/5 gives the classical result, (S^), and the deviation is 

23 P.W. Anderson, Phys. Rev. 86, 694 (1952). 



28 



the spin wave reduction of the sublattice moment 



8 Ma 



= (sf) - s = -^E(4^k> 

iV k 

1 

= - E ( (u k a[ + v k (3 k ) (u k a k + v k ft k ) ) 
iv k 

1 

iv k 



(71) 



where N is the number of spins on sublattice B. We have ne- 
glected cross terms like (c^k) because the a and j3 are inde- 
pendent quanta by construction. However the diagonal averages 
(a k a k ) and (f3 k /3 k ) are the expectation values for the number op- 
erators of independent bosons with dispersion u k in equilibrium, 
and so can be replaced (within linear spin wave theory) by 



n k = ((3 k (3k) = (a k a k ) = b(u k ), 



(72) 



where b is the Bose distribution function. The tranformation 
coefficients u k = cosh 6 k and v k = sinh k are determined by the 
condition (66) such that 

1 



2 2 



= cosh 20 k 



1 

2 



1 



\ 

- 1 



such that the sulattice magnetization (71) becomes 



(73) 



(74) 



5M A 11/ 1 

= 2" iv¥r k+ 2. 



i 



(75) 



Remarks: 
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1. The correction 5 Ma is independent of S, and negative as 
it must be (see next point). However relative to the leading 
classical term S it becomes smaller and smaller as S increases, 
as expected. 

2. The integral in (75) depends on the dimensionality of the 
system. It has a T-dependent part coming from and a 
T-independent part coming from the 1/2. At T = 0, where 
there are no spin waves excited thermally, = 0, and we 



The divergence in D = 1 indicates the failure of spin-wave 
theory in one dimension, as discussed further in section xxx. 

3. The low temperature behavior of 5M(T) must be calculated 
carefully due to the singularities of the bose distribution func- 
tion when Uk — > 0. If this divergence is cut off by introducing 
a scale k$ near k = and k = (n/a, n/a), one finds that 5 Ma 
diverges as 1/ko in ID, and as log ho in 2D, whereas it is finite 
as ko — > in 3D. Thus on this basis one does not expect long 
range spin order at any nonzero temperature in two dimen- 
sions (see discussion of Mermin- Wagner theorem below), nor 
even at T = in one dimension. 



find 



5 Ma 



-0.078 D = 3 
-0.196 D = 2 
oo D = 1 



(76) 
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3.4.3 Nonlinear spin wave theory 

3.4.4 Frustrated models 

3.5 ID & 2D Heisenberg magnets 

3.5.1 Mermin- Wagner theorem 

3.5.2 ID: Bethe solution 

3.5.3 2D: Brief summary 

3.6 Itinerant magnetism 

"Itinerant magnetism" is a catch-all phrase which refers to mag- 
netic effects in metallic systems (i.e., with conduction electrons). 
Most of the above discussion assumes that the spins which interact 
with each other are localized, and there are no mobile electrons. 
However we may study systems in which the electrons which mag- 
netize, or nearly magnetize are mobile, and situations in which 
localized impurity spins interact with conduction electron spins 
in a host metal. The last problem turns out to be a very difficult 
many-body problem which stimulated Wilson to the develop of 
renormalization group ideas. 

3.7 Stoner model for magnetism in metals 

The first question is, can we have ferromagnetism in metallic sys- 
tems with only one relevant band of electrons. The answer is yes, 
although the magnetization/electron in the ferromagnetic state 
is typically reduced drastically with respect to insulating ferro- 
magnets. The simplest model which apparently describes this 
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kind of transition (there is no exact solution in D > 1) is the 
Hubbard model we have already encountered. A great deal of at- 
tention has been focussed on the Hubbard model and its variants, 
particularly because it is the simplest model known to display 
a metal-insulator (Mott-Hubbard) transition qualitatively similar 
to what is observed in the high temperature superconductors. To 
review, the Hubbard model consists of a lattice of sites labelled 
by z, on which electrons of spin j or j may sit. The kinetic energy 
term in the Hamiltonian allows for electrons to hop between sites 
with matrix element and the potential energy simply requires 
an energy cost U every time two opposing spins occupy the same 
site. 24 Longer range interactions are neglected: 

H = -t £ 4 a c ja + -U^riiani-a, (77) 

o Z cr 

<i,3> 

where < ij > means nearest neighbors only. 




t 



In its current form the kinetic energy, when Fourier transformed, 
corresponds, to a tight binding band in d dimensions of width 4dt, 

d 

6k = — 2t zZ cos k a a, (78) 

a—l 



What happened to the long-range part of the Coulomb interaction? Now that we know it is 
screened, we can hope to describe its effects by including only its matrix elements between electrons in 
wave functions localized on site i and site j, with \i — j\ smaller than a screening length. The largest 
element is normally the i = j one, so it is frequently retained by itself. Note the Hamiltonian (77) 
includes only an on-site interaction for opposite spins. Of course like spins are forbidden to occupy the 
same site anyway by the Pauli principle. 
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where a is the lattice constant. The physics of the model is as 
follows. Imagine first that there is one electron per site, i.e. the 
band is half-filled. If U = the system is clearly metallic, but if 
U — > oo, double occupation of sites will be "frozen out". Since 
there are no holes, electrons cannot move, so the model must 
correspond to an insulating state; at some critical U a metal- 
insulator transition must take place. We are more interested in the 
case away from half-filling, where the Hubbard model is thought 
for large U and small doping (deviation of density from 1 parti- 
cle/site) to have a ferromagnetic ground state. 25 In particular, we 
would like to investigate the transition from a paragmagnetic to 
a ferromagnetic state as T is lowered. 

This instability must show up in some quantity we can calcu- 
late. In a ferromagnet the susceptibility % diverges at the tran- 
sition, i.e. the magnetization produced by the application of an 
infinitesimal external field is suddenly finite. In many-body lan- 
guage, the static, uniform spin susceptibility is the retarded spin 
density - spin density correlation function, (for the discussion be- 
low I take gfiB = 1) 

X = X (q = 0, w = 0) = Hm ^ = / d 3 r / Q °° dt{[S z (r, t), 5,(0, 0)]>, 

(79) 

where in terms of electron number operators n a = ^Jf/v, the 
magnetization operators are S z = (1/2) [n-f — nj, i.e. they just 
measure the surplus of up over down spins at some point in space. 

25 At 1/2-filling, one electron per site, a great deal is known about the Hubbard model, in particular 
that the system is metallic for small U (at least for non-nested lattices, otherwise a narrow-gap spin 
density wave instability is present), but that as we increase U past a critical value U c ~ D a transition to 
an antiferromagnetic insulating state occurs (Brinkman-Rice transition). With one single hole present, 
as U — > oo, the system is however ferromagnetic (Nagaoka state). 
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Figure 9: la) Hubbard interaction; lb) Spin susceptibility vs. T for free fermions. 

Diagramatically, the Hubbard interaction Hi nt = U Ei n^n^ 
looks like figure 9a); note only electrons of opposite spins interact. 
The magnetic susceptibility is a correlation function similar to 
the charge susceptibility we have already calculated. At time 
t=0, we measure the magnetization S z of the system, allow the 
particle and hole thus created to propagate to a later time, scatter 
in all possible ways, and remeasure S z . The so-called "Stoner 
model" of ferromagnetism approximates the perturbation series 
by the RPA form we have already encountered in our discussion 
of screening, 26 which gives % = Xo/(l — U Xo)- At sufficiently high 
T (xo varies as 1/T in the nondegenerate regime, Fig. 9c)) we will 
have Uxo{T) < 1, but as T is lowered, Uxo(T) increases. If U 
is large enough, such that UNq > 1, there will be a transition at 
Uxo(T c ) = 1, where % diverges. Note for Uxo(T) > 1 (T < T c ), 
the susceptibility is negative, so the model is unphysical in this 
region. This problem arises in part because the ferromagnetic 
state has a spontaneously broken symmetry, (S z ) ^ even for 



2i. 



In the static, homogeneous case it is equivalent to the self-consistent field (SCF) method of Weiss. 
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h z —> 0. Nevertheless the approach to the transition from above 
and the location of the transition appear qualitatively reasonable. 

It is also interesting to note that for £/%(()) = UNq < 1, there 
will be no transition, but the magnetic susceptibility will be en- 
hanced at low temperatures. So-called "nearly ferromagnetic" 
metals like Pd are qualitatively described by this theory. Com- 
paring the RPA form 

X ~ 1 - U X o(T) m 
to the free gas susceptibility in Figure 9b, we see that the system 
will look a bit like a free gas with enhanced density of states, and 
reduced degeneracy temperature T F 27 For Pd, the Fermi temper- 
ature calculated just by counting electrons works out to 1200K, 
but the susceptibility is found experimentally to be ~ 10 x larger 
than No calculated from band structure, and the susceptibility is 
already Curie-like around T ~ Tp ~300K. 

3.7.1 Moment formation in itinerant systems 

We will be interested in asking what happens when we put a lo- 
calized spin in a metal, but first we should ask how does that 
local moment form in the first place. If an arbitrary impurity is 
inserted into a metallic host, it is far from clear that any kind of 
localized moment will result: a donor electron could take its spin 
and wander off to join the conduction sea, for example. Fe impu- 
rities are known to give rise to a Curie term in the susceptibility 
when embedded in Cu, for example, but not in Al, suggesting 



27 Compare to the Fermi liquid form 



m 1 + F a ' ^ 
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that a moment simply does not form in the latter case. Ander- 
son 28 showed under what circumstances an impurity level in an 
interacting host might give rise to a moment. He considered a 
model with a band of electrons 29 with energy ek, with an extra 
dispersionless impurity level Eq. Suppose there are strong local 
Coulomb interactions on the impurity site, so that we need to 
add a Hubbard-type repulsion. And finally suppose the conduc- 
tion (d) electrons can hop on and off the impurity with some 
matrix element V . The model then looks like 

1 

H = E ^ci a c ka +E £ noa+V E(cL c o+ c oCfor)+ -U E n 0a n - a , 

(82) 

where nQ a = c\ a CQ a is the number of electrons of spin a on the 
impurity site 0. By the Fermi Golden rule the decay rate due 
to scattering from the impurity of a band state e away from the 
Fermi level Ep in the absence of the interaction U is of order 

A(e) = nV 2 E S(e - e k ) ~ nV 2 N (83) 

k 

In the "Kondo" case shown in the figure, where Eq is well below 
the Fermi level, the scattering processes take place with electrons 
at the Fermi level e = 0, so the bare width of the impurity state 
is also A ~ ttV 2 Nq. So far we still do not have a magnetic mo- 
ment, since, in the absence of the interaction U, there would be 
an occupation of 2 antiparallel electrons. If one could effectively 
prohibit double occupancy, however, i.e. if U ^> A, a single 
spin would remain in the localized with a net moment. Anderson 

28 PW Anderson, Phys. Rev. 124, 41 (1961) 

29 The interesting situation for moment formation is when the bandwidth of the "primary" cond. 
electron band overlapping the Fermi level is much larger than the bare hybridization width of the 
impurity state. The two most commonly considered situations are a band of s electrons with <i-level 
impurity (transition metal scries) and d-electron band with localized /-level (rare earths/actinides- 
heavy fermions). 
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obtained the basic physics (supression of double occupancy) by 
doing a Hartree-Fock decoupling of the interaction U term. Schri- 
effer and Wolff in fact showed that in the limit U — > — oo, the 
virtual charge fluctuations on the impurity site (occasional dou- 
ble occupation) are eliminated, and the only degree of freedom 
left (In the so-called Kondo regime corresponding to Fig. 10a) is 
a localized spin interacting with the conduction electrons via an 
effective Hamiltonian 

7~i Kondo = JS • a, (84) 

where J is an antiferromagnetic exchange expressed in terms of 
the original Anderson model parameters as 

V 2 

J = 2—, (85) 

S is the impurity spin- 1/2, and 

G i = \ £ cl a (Ti) a pCvp, (86) 

with Ti the Pauli matrices, is just the conduction electron spin 
density at the impurity site. 



3.7.2 RKKY Interaction 

Kittel p. 360 et seq. 



3.7.3 Kondo model 

The Hartree-Fock approach to the moment formation problem 
was able to account for the existence of local moments at defects 
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Figure 10: Three different regimes of large U Anderson model depending on position of 
bare level Eq. In Kondo regime (Eq << Ep), large moments form at high T but are 
screened at low T. In mixed valent regime, occupancy of impurity level is fractional and 
moment formation is marginal. For Eq > Ep, level is empty and no moment forms. 

in metallic hosts, in particular for large Curie terms in the suscep- 
tibility at high temperatures. What it did not predict, however, 
was that the low-temperature behavior of the model was very 
strange, and that in fact the moments present at high tempera- 
tures disappear at low temperatures, i.e. are screened completely 
by the conduction electrons, one of which effectively binds with 
the impurity moment to create a local singlet state which acts (at 
T = 0) like a nonmagnetic scatterer. This basic physical picture 
had been guessed at earlier by J. Kondo, 30 who was trying to 
explain the existence of a resistance minimum in certain metallic 
alloys. Normally the resistance of metals is monotonically decreas- 
ing as the temperature is lowered, and adding impurities gives rise 
to a constant offset (Matthiesen's Rule) which does not change 
the monotonicity. For Fe in Cu, however, the impurity contribu- 



:5U 



J. Kondo, Prog. Theor. Phys. 32, 37 (64). 
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tion 5 pi mp increased as the temperature was lowered, and even- 
tually saturated. Since Anderson had shown that Fe possessed a 
moment in this situation, Kondo analyzed the problem perturba- 
tively in terms of a magnetic impurity coupling by exchange to a 
conduction band, i.e. adopted Eq. (84) as a model. Imagine that 
the system at t = — oo consists of a Fermi sea |0) and one ad- 
ditional electron in state |k, a) at the Fermi level. The impurity 
spin has initial spin projection M, so we write the initial state as 
\i) = 4J0; M) Now turn on an interaction TCi adiabatically and 
look at the scattering amplitude 31 
between \i) and |/) = c^^O] M f ) 

(f\T\i) = -2ni(f\H 1 +H 1 —^—H 1 + ...\i) (90) 

ek — no 

If 7i\ were just ordinary impurity (potential) scattering, we 
would have Hi = EkkW c Lx^kk /C kV> an d there would be two 
distinct second-order processes k — > k' contributing to Eq. (90), 
as shown schematically at the top of Figure 11, of type a), 

(0|c k 4,V k / p c p L_ct V^ck^lO) = ^p ^^V pk (91) 

6k — no 6k — 6 p 

31 Reminder: when using Fermi's Golden Rule (see e.g., Messiah, Quantum Mechanics p. 807): 

< 87 > 

we frequently are able to get away with replacing the full T-matrix by its perhaps more familiar lst-order 
expansion 

(Recall the T matrix is defined by ((f) f\T\(f>i) = (<t> f\T\ipf) , where the <j>'s are plane waves and ip + is 
the scattering state with outgoing boundary condition.) In this case, however, we will not find the 
interesting log T divergence until we go to 2nd order! So we take matrix elements of 

T = n 1 + n 1 — 1 —n 1 + ... (89) 

£k — no 

This is equivalent and, I hope, clearer than the transition amplitude I calculated in class. 
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and type b), 

(0|c k /c^Vp k c k } n 4'Vic'pCpCklO) 



£k — (ek — e p + e k /) 



- Vpk 7 ; r Vk'p 



= Vpt^^-H/p (92) 

€k' - 6 p 

where I have assumed k is initially occupied, and k 7 empty, with 
e k = ^k'j whereas p can be either; the equalities then follow from 
the usual application of Sc and cc^ to |0). Careful checking of 
the order of the c's in the two matrix elements will show you that 
the first process only takes place if the intermediate state p is 
unoccupied, and the second only if it is unoccupied. 

Nnow when one sums the two processes, the Fermi function 
cancels. This means there is no significant T dependence to this 
order (pi mp due to impurities is a constant at low T), and thus 
the exclusion principle does not play an important role in ordinary 
potential scattering. 



Now consider the analagous processes for spin-scattering. The 
perturbing Hamiltonian is Eq. 84. Let's examine the amplitude 
for spin-flip transitions caused by Hi, first of type a), 

(0M s ,\Hi\0M s ) (93) 

J2 i 

= — S fl S u (OM sl \c kl(jl cl af TX"C pa " —-c ] ff r^ a c ka cl a \i)M s ) 

4 e k - /To 

4 €k — € p 
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Figure 11: 2nd-order scattering processes: a) direct and b) exchange scattering. Top: 
potential scattering; bottom: spin scattering. Two terms corresponding to Eq. 90 



and then of type b) 

j 2 



I^L {M ' s \s v s,tX^Am s ). 

4 €k — e p 



(94) 



Now note that Tj a nTji a = (t^t^)^, and use the identity r^r v = 
Sup + i r a e apiv The Sfijy pieces clearly will only give contributions 
proportional to S 2 , so they aren't the important ones which will 
distinguish between a) and b) processes compared to the potential 
scattering case. The e aufl terms give results of differing sign, since 
a) gives e avfJL and b) gives e aj2U . Note the basic difference between 
the 2nd-order potential scattering and spin scattering is that the 
matrix elements in the spin scattering case, i.e. the S^, didn't 
commute! When we add a) and b) again the result is 

J 2 (11 

S(S+1)- (M' s a'\S-a\Ms,a) 



6k 



e 



p 



4 



(95) 
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so the spin-scattering amplitude does depend on T through /(e p ). 
Summing over the intermediate states pa" gives a factor 



which is of order logT for states £k at the Fermi surface! Thus 
the spin part of the interaction, to a first approximation, makes 
a contribution to the resistance of order J 3 logT (p involves the 
square of the scattering amplitude, and the cross terms between 
the 1st and 2nd-order terms in perturbation theory give this re- 
sult). Kondo pointed to this result and said, "Aha!", here is a 
contribution which gets big as T gets small. However this can't 
be the final answer. The divergence signals that perturbation the- 
ory is breaking down, so this is one of those very singular problems 
where we have to find a way to sum all the processes. We have 
discovered that the Kondo problem, despite the fact that only a 
single impurity is involved, is a complicated many-body problem. 
Why? Because the spin induces correlations between the electrons 
in the Fermi sea. Example: suppose two electrons, both with spin 
up, try to spin flip scatter from a spin-down impurity. The first 
electron can exchange its spin with the impurity and leave it spin 
up. The second electron therefore cannot spin-flip scatter by spin 
conservation. Thus the electrons of the conduction band can't be 
treated as independent objects. 

Summing an infinite subset of the processes depicted in Figure 
11 or a variety of other techniques give a picture of a singlet 
bound state where the impurity spin binds one electron from the 



/(e P ) 



p — e p 




(96) 
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conduction electron sea with binding energy 

T K = L>e- 1/(JiVo) , (97) 

where D is the conduction electron bandwidth. The renormaliza- 
tion group picture developed by Wilson in the 70s and the exact 
Bethe ansatz solution of Wiegman/Tsvelick and Andrei/Lowenstein 
in 198? give a picture of a free spin at high temperatures, which 
hybridizes more and more strongly with the conduction electron 
sea as the temperature is lowered. Below the singlet formation 
temperature of Tk, the moment is screened and the impurity acts 
like a potential scatterer with large phase shift, which approaches 
tt/2 at T = 0. 
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Reading: 

1. Ch. 26, Ashcroft & Mermin 

2. Ch. 7, Kittel 

4 Electron-phonon interaction 
4i Hamiltonian 

The subtle interplay of electrons and phonons was explained in the 
50 's by some of the earliest practitioners of quantum many-body 
theory, leading eventually to an understanding of the mechanism 
underlying superconductivity. Recall that the ions in a metal have 
two basic effects on the electronic states: 1) the static ionic lat- 
tice provides a periodic potential in which conduction electrons 
must move, leading to the evolution of plane wave states in the 
Fermi gas into Bloch waves in the crystal, and 2) the scatter- 
ing of electrons by lattice vibrations, and vice versa. The first 
effect will be ignored here, as we are essentially interested in long- 
wavelength phenomena, where the differences between proper cal- 
culations using Bloch waves and simpler ones using plane waves 
are negligible. It suffices then to consider the phonons in a lattice 
interacting with a Fermi gas in which the most important effects 
of the long-range Coulomb interaction have been accounted for. 
Without the Coulomb interaction, the phonon frequencies are 
just those we would get from a classical model of balls of mass M 
(ionic mass) connected by springs. For a 3D solid with 1 atom 
per unit cell, there are 3N normal modes comprising 3 acoustic 
phonon branches u^. When one includes the long-range Coulomb 
interaction but neglects the electron-phonon coupling, one finds 
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that the longitudinal acoustic mode has been lifted to the ionic 
plasma frequency, uj^f 1 ~ (A-KZ 2 e 2 n/M) 1 / 2 . The terms of the 
Goldstone theorem which insists on the existence of an acoustic 
mode for each spontaneously broken continuous symmetry are vi- 
olated by the long-range nature of the Coulomb force, and the 
sloshing back and forth of the ion "fluid" at uf^ 1 occurs for the 
same reason and at the same frequency (up to the mass difference) 
that it does in the electron case. At this point we are seriously 
worried that we don't understand how acoustic phonons ever ex- 
ist in charged systems. If one now includes the electron- phonon 
coupling, however, the electronic medium screens the long-range 
Coulomb interaction, leading to a finite interaction length and the 
recovery of the Goldstone (acoustic) mode. 

Let's give a brief overview of where we're going. I first want 
to get to the point where we can write down the full Hamiltonian 
for the problem. We want to show that it makes sense to write 
the Hamiltonian describing the electron-phonon system as 

H = Hel + Hph + Hcoul + H int: (1) 

where 

H° el = E&4cto (2) 

ka 

H® h = Y.Uk\(a\ x a kX + \) (3) 

k\ 2 

Hcoul = - E V{q)c\ l+qal c\ a c k+q(J c k > a > (4) 

^ kk'q 
go' 

Hint = E 9kk'c\a C k'a{a ] - q X + a qX) (5) 
kk'aX 

where a\ x creates a phonon with wave vector q = k' — k and 
polarization A, and g k y oc M~ 1 / 2 is the bare electron-phonon 
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coupling. The unperturbed phonon Hamiltonian H p h is of course 
just the sum of 3N independent harmonic oscillators in 2nd quan- 
tized form, and the bare Coulomb matrix element in Hcoul is 
V(q) = 47re 2 /q 2 . The derivation of the electron-phonon Hamilto- 
nian H int and its quantization is relatively straightforward, and I 
will sketch it here. 

4.1.1 Derivation of e~-ph coupling 

Assume the ion is located at position Rj, at a displacement 
from its equilibrium position R°. If the potential of the ion is 
assumed to be rigid, the interaction energy of the electronic charge 
density with the ions is simply 1 

H int = Y.J d 3 ri>l (r)W(r)V(r - R,). (6) 

ia 

For small amplitude vibrations, we can expand in powers of u^: 
H int = £/ tfV^(r)<Mr)V(r - R?) 

ia 

+ E/rf 3 r^(r)^(r) Ui • V R ,V(r - R,)| R o + . . . (7) 

ia 1 

Now expand the field operators t/j a in terms of Bloch waves: 

ip(r) =Ec k(7 k (r), (8) 
k 

where 

<Mr + R°) = e*' R ^ k (r) (9) 

so the quantity which appears in Eq. (7) may be recast by per- 
forming a shift by a Bravais lattice vector and using the periodicity 

1 This is for a Bravais lattice. If there is a basis one has to be a bit careful about labelling the lattice 
sites with additional intracell indices, i.e. Rj Q , a = 1 . . .m, where m is number of atoms/cell. 
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of V R oV(r - 

/ d 3 r«, CT (r)^(r)V R oV(r - R°) 

= / d 3 r<j>* Va (r + R»)^(r + R)) V R ,V(r 

= e 1 ^'^ / d 3 r<fc(r)^(r) V R oV(r ■ 



R?) 



(10) 



w 



kk' 
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Now let us 2nd-quantize the displacement u as we did when we 
were discussing the isolated phonon system, 2 

1 



with 



<?A(q) = 



1 



'2wa(«|) 

so interaction Hamiltonian can be rewritten 



a A (q) + al(-q)) 



(13) 



(14) 



,o 1 



kk'a j VNM 

xEft(q)e A (q)e iq - R ? 

qA 



X 



E E^A (Wkt/ • e A (q)) Q A (q) 

kk'a A 7 N 



TV 
M 



= E ^kk'ACkvQ^r (a A (q) + aj(-q) 

kk'aA 



(15) 



2 Before we dealt primarily with the ID chain, so I supressed the polarization indices. It is important 
to recall that in a 3 dimensional system there are 3N normal modes (3mN if there are m atoms per 
unit cell) . For each value of k there are 3 acoustic (optical only if there are add'l atoms per unit cell 
requiring index a) modes denoted by different values of the branch index A. The vectors e A (k) are the 
polarization vectors of the modes and satisfy the orthogonality condition 



(12) 
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where now q due to momentum conservation (^-function from 
summing over j) is to be interpreted as 

q = k-k' + G (16) 

with G is a vector of reciprocal lattice (arose because q was defined 
to lie in 1st B-zone). The electron-phonon coupling constant is 



9kk'\ = (W kk / -e A (q) 



(IT) 



\ 2Mu$"(q) 

The final result, then, is that an electron in state k, a can un- 
dergo a scattering process with amplitude g kk /, ending up in final 
state k', a by absorption (emission) of a phonon of momentum 
q. This form is useful, but calculating g kk / from first principles is 
difficult because V is poorly known. 



4.1.2 Jellium model 

We can get some dimensionally reasonable results in the so-called 
"jellium" model, where the ions are represented as a featureless, 
positively charged elastic continuum, 3 we will simply replace the 
eigenfrequencies uj^x of the neutral system by the constant uf^ 1 
according to the arguments given above. Again we expand the 
crystal potential V(r — Rj) around the equilibrium sites R®. The 
dispacements u(R) in the jellium gives a charge density fluctua- 
tion 4 —nZeV - u. This interacts with the electron gas through 
the Coulomb interaction, leading to the interaction 

H mum = Ze 2 Y.j d 3 rd 3 r' 4 ( r )^(r)— L-V • u(r'), (18) 



3 ... justified by the large masses and correspondingly long timcscalcs for ionic motion (Born- 
Oppcnheimer) 

4 Rccall from E& M the polarization charge density is pp = — V • P, where P is the polarization, and 
the polarization due to a density n of dipole moments p = Zeu is therefore nZeu. 
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and then quantizing the ionic displacements Ui = Ri — Rf- as in 
Eq.(13), one finds 



Hlnt mm = E c{, a c k(T (a A (q) + a\(q) 



,t 



,t 



Airiq • e^(q) 2e 2 n 
MV2 



(19) 

kkV ivi^/- g 

Comparing with Eq. (15), we see that the effective e-ph coupling 
constant in an isotropic system is 



g{q) = 



47rzZeV/ 2 



(20) 



4.1.3 Screening. 

The first point I would like to review is the renormalization of the 
electron-phonon coupling which leads to screening and the recov- 
ery of the acoustic spectrum. The main point is to realize that 
the singular behavior is due to the long-range Coulomb interac- 
tion, which will be screened. Any time a phonon of momentum 
q is excited, it creates charge density fluctuations because the 
ions are positively charged. These then interact with the electron 
gas. The bare potential created by the charge fluctuation is pro- 
portional to the electron-phonon coupling constant, so screening 
the ionic charge amounts to replacing the bare coupling g with a 
screened coupling 

g(q,u) n ) = g(q)/e(q,u n ), (21) 

where q = k' — k and e(</, u n ) = 1 — V(q)x(q, u n ) is the RPA 
dielectric constant. The frequency dependence of the dielectric 
constant will become important for us on scales of the Debye 
frequency ujd or smaller, and this is normally a very small scale by 
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electronic standards! So for most purposes we keep a frequency- 
independent screened coupling g(q) ~ g{q)/£{q, 0). 

We would like to see that this screening recovers acoustic modes 
as observed and expected. The bare interaction Hamiltonian may 
be written in the Jellium model as (see Eq. (15) 

H^t = T.g{q)n- q Q q (22) 

Q 

with n q = E q c^ +q Ck. Consider now the entire Hamiltonian for 
the phonon coordinates, including the coupling to the electron gas 
(recall derivation of normal modes for linear chain in Section 1): 

1/1 -2 

2q\M 

(23) 

The Heisenberg equation of motion for the operator Q q becomes 
(check!) 

Q q + co;fQ q + 9 _ q n q = (24) 

We noted above that the ionic charge density fluctuation induced 
by an ionic displacement u was en lon = —nZeV • u; in Fourier 
space with Eq. (13) this reads 



Hph + H int = o E ( + MculT Q q Q_ q + 2g q Q q n_ q 



n 

^ ~ M 
Also recall the definition of the dielectric constant 

"external charge" 

e = . 

total charge 

Now the total charge fluctuation is just electronic + ionic n q +7iq 

so 

n q = <"(1 - 1/6) = -iZ^ -qQ q (l - 1/6) (27) 



nT = -iZ t ^qQ q (25) 



(26) 



ion 



= iZ* 



n 



\M 



k 2 



TF 



(28) 



q 2 + k\ F ) 

where in the last step we have used the Thomas-Fermi approxi- 
mation for the dielectric function, e = 1 + k TF /q 2 . 



2 _ ion' 

^phonon pi 



1 - 



k 2 



q 2 + kj^p 



(29) 



Equation of motion becomes 



= Qq + wTOq + 



UmZe 2 n l l T 
{ qM\ 



iZ. 



n 



\M 



qQ q 



k 2 



TF 



q 2 + kj>p 



tow 
pi 



1 - 



k 2 



TF 



q 2 + k 2 



TF, 



<2q- 



(30) 



So the phonon oscillation frequencies are 



l _ ion* 

^phonon 10 pi 



q< 



q 2 + k 2 



(31) 



TF, 



Since k TF = Qirne 2 / 'Ep, we do in fact recover acoustic phonons 
iu n = cq as q — > 0, with speed 

c=(m/3M) 1/ V (32) 

So we have proven that sound propagates in a metal, which you 
already knew, but it is important to remember that screening 
makes it happen! 
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4.2 Polarons 



4.3 Bloch resistivity 

4.4 Effective e~ — e~ interaction 

For superconductivity it will be important to understand why 
one electron attracts each other effectively by polarizing the lat- 
tice. We now have at our disposal the effective screened electron- 
phonon interaction with coupling constant g^i and renormalized 
phonon frequencies LJ\(q). An interaction between electrons must 
involve the 1st electron polarizing the lattice, and the second inter- 
acting with this polarization, i.e. it must be a 2nd-order process 
in the coupling constant g. We can represent these processes in 
terms of Feynman-type pictures: The amplitude of the first pro- 




Figure V. Electron-electron interaction due to phonon exchange. Straight lines are 
electrons, wiggly are phonons, vertices are e-ph coupling constants g. 

cess, according to 2nd-order perturbation theory, where the 1st 
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electron emits a phonon, is 



V 1 

Q 



1 



\g, 



(33) 



whereas for the process where it absorbs a phonon we get 

V q 2 = _ |g " |2 _ . (34) 

Note as usual we have energy conservation at every scattering 
event depicted in these processes, i.e. 



and I've assumed u q = uj- q . According to quantum mechanics 
the two amplitudes contribute additively: 



Let's analyze this expression for the "effective electron-electron 
interaction" qualitatively. We will primarily be interested in ek — 
^k-q <C since this situation corresponds to electrons with 
their momenta both quite close to the Fermi surface, well within 
a shell of width ljd, a typical phonon energy. 

• attractive 

• ind. of k =>* isotropic 

• rapidly decreases when €k — ek- q ~ ujd- 

• energy space: interaction spread over ud =>* in time space it's 
retarded 

• comparable when spread over time to Coulomb interactions: 
some metals superconducting, some not. 



^k ~r £p ^k— q H~ ^p+q 



(35) 




(36) 
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5 Superconductivity 

5.1 Phenomenology 

Superconductivity was discovered in 1911 in the Leiden lab- 
oratory of Kamerlingh Onnes when a so-called "blue boy" 
(local high school student recruited for the tedious job of mon- 
itoring experiments) noticed that the resistivity of Hg metal 
vanished abruptly at about 4K. Although phenomenological 
models with predictive power were developed in the 30's and 
40's, the microscopic mechanism underlying superconductiv- 
ity was not discovered until 1957 by Bardeen Cooper and 
Schrieffer. Superconductors have been studied intensively for 
their fundamental interest and for the promise of technologi- 
cal applications which would be possible if a material which 
superconducts at room temperature were discovered. Until 
1986, critical temperatures (T c 's) at which resistance disap- 
pears were always less than about 23K. In 1986, Bednorz and 
Mueller published a paper, subsequently recognized with the 
1987 Nobel prize, for the discovery of a new class of materials 
which currently include members with T c 's of about 135K. 
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Figure 1: Properties of superconductors. 

Superconducting materials exhibit the following unusual be- 
haviors: 

1. Zero resistance. Below a material's T c , the DC elec- 
trical resistivity p is really zero, not just very small. This 
leads to the possibility of a related effect, 

2. Persistent currents. If a current is set up in a super- 
conductor with multiply connected topology, e.g. a torus, 
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it will flow forever without any driving voltage. (In prac- 
tice experiments have been performed in which persistent 
currents flow for several years without signs of degrading). 

3. Perfect diamagnetism. A superconductor expels a 
weak magnetic field nearly completely from its interior 
(screening currents flow to compensate the field within a 
surface layer of a few 100 or 1000 A, and the field at the 
sample surface drops to zero over this layer). 

4. Energy gap. Most thermodynamic properties of a su- 
perconductor are found to vary as e~ A ^ kBT \ indicating 
the existence of a gap, or energy interval with no allowed 
eigenenergies, in the energy spectrum. Idea: when there 
is a gap, only an exponentially small number of particles 
have enough thermal energy to be promoted to the avail- 
able unoccupied states above the gap. In addition, this gap 
is visible in electromagnetic absorption: send in a photon 
at low temperatures (strictly speaking, T = 0), and no 
absorption is possible until the photon energy reaches 2 A, 
i.e. until the energy required to break a pair is available. 

5.2 Electron-phonon interaction 

Superconductivity is due to an effective attraction between 
conduction electrons. Since two electrons experience a re- 
pulsive Coulomb force, there must be an additional attrac- 
tive force between two electrons when they are placed in a 
metallic environment. In classic superconductors, this force 



3 



is known to arise from the interaction with the ionic system. 
In previous discussion of a normal metal, the ions were re- 
placed by a homogeneous positive background which enforces 
charge neutrality in the system. In reality, this medium is 
polarizable- the number of ions per unit volume can fluctuate 
in time. In particular, if we imagine a snapshot of a single 
electron entering a region of the metal, it will create a net 
positive charge density near itself by attracting the oppositely 
charged ions. Crucial here is that a typical electron close to 
the Fermi surface moves with velocity vp = hkpjm which is 
much larger than the velocity of the ions, vj = Vpm/M. So 
by the time (r ~ 2tt/ujd ~ 10~ 13 sec) the ions have polarized 
themselves, 1st electron is long gone (it's moved a distance 

vpr rsj \{fcm/s rsj 1000 A, and 2nd electron can happen by 
to lower its energy with the concentration of positive charge 
before the ionic fluctuation relaxes away. This gives rise to an 
effective attraction between the two electrons as shown, which 
may be large enough to overcome the repulsive Coulomb in- 
teraction. Historically, this electron-phonon "pairing" mech- 
anism was suggested by Frolich in 1950, and confirmed by 
the discovery of the "isotope effect" , wherein T c was found to 
vary as M~ 1//2 for materials which were identical chemically 
but which were made with different isotopes. 

The simplest model for the total interaction between two 
electrons in momentum states k and k 7 , with q = k — k 7 , in- 
teracting both by direct Coulomb and electron-phonon forces, 
is given by 
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Figure 2: Effective attraction of two electrons due to "phonon exchange" 



47re 2 47re 2 J\ 

in the jellium model. Here first term is Coulomb interac- 
tion in the presence of a medium with dielectric constant 
e = 1 + fcg/g 2 , and cu q are the phonon frequencies. The 
screening length k~ l is 1A or so in a good metal. Second 
term is interaction due to exchange of phonons, i.e. the mech- 
anism pictured in the figure. Note it is frequency-dependent, 
reflecting the retarded nature of interaction (see figure), and 
in particular that the 2nd term is attractive for cu < cu q ~ cup. 
Something is not quite right here, however; it looks indeed as 
though the two terms are of the same order as cu — > 0; indeed 
they cancel each other there, giving V(u — > 0) = 0. Further- 
more, V is always attractive at low frequencies, suggesting 
that all metals should be superconductors, which is not the 
case. These points indicate that the jellium approximation 
is too simple. We should probably think about a more care- 
ful calculation in a real system as producing two equivalent 
terms, which vary in approximately the same way with fc^F 
and cj q , but with prefactors which are arbitrary. In some ma- 



terials, then, the second term might "win" at low frequencies, 
depending on details. 

5.3 Cooper problem 

A great deal was known about the phenomenology of super- 
conductivity in the 1950 's, and it was already suspected that 
the electron phonon interaction was responsible, but the mi- 
croscopic form of the wave function was unknown. A clue 
was provided by Leon Cooper, who showed that the noninter- 
acting Fermi sea is unstable towards the addition of a single 
pair of electrons with attractive interactions. Cooper began 
by examining the wave function of this pair ^(^l?^), which 
can always be written as a sum over plane waves 

V>(ri, r 2 ) = E Ukiqy^e-^+^X (2) 

kq 

where the Uk(q) are expansion coefficients and ( is the spin 
part of the wave function, either the singlet | f J, — ||> / \/2 
or one of the triplet, | |t>, I U>, I U + IT> /v^- In 
fact since we will demand that ij) is the ground state of the 
two-electron system, we will assume the wave function is re- 
alized with zero center of mass momentum of the two elec- 
trons, Uk(q) = UkS q fi. Here is a quick argument related to 
the electro n-phonon origin of the attractive interaction. 1 The 
electron-phonon interaction is strongest for those electrons 
with single-particle energies £k within uod of the Fermi level. 
In the scattering process depicted in Fig. 3, momentum is 

1 Thanks to Kevin McCarthy, who forced me to think about this further 
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Figure 3: Electrons scattered by phonon exchange are confined to shell of thickness Ud about Fermi 
surface. 



explicitly conserved, i.e. the total momentum 

k + p = K (3) 

is the same in the incoming and outgoing parts of the diagram. 
Now look at Figure 4, and note that if K is not ~ 0, the phase 
space for scattering (attraction) is dramatically reduced. So 
the system can always lower its energy by creating K = 
pairs. Henceforth we will make this assumption, as Cooper 
did. 

Then ^(ri,^) becomes EkUke zk '( ri ~ T2 \ Note that if is 
even in fc, the wave function has only terms oc cos k • (ri — r2), 
whereas if it is odd, only the sin k • (r\ — r?) will contribute. 
This is an important distinction, because only in the former 
case is there an amplitude for the two electrons to live "on 
top of each other" at the origin. Note further that in order to 



Figure 4: To get (attractive) scattering with finite cm momentum K, need both electron energies 
to be within o>d of Fermi level- very little phase space. 



preserve the proper overall antisymmetry of the wave function, 
Uk even (odd) in k implies the wave function must be spin 
singlet (triplet). Let us assume further that there is a general 
two-body interaction between the two electrons (the rest of 
the Fermi sea is noninteracting in the model!) V^ri,^), so 
that the Hamiltonian for the system is 



Inserting the assumed form of ip into the Schrodinger equation 
Htfj = Eip^ and Fourier transforming both sides with respect 
to the relative coordinate, r = r\ — we find 






k>kp 



(5) 



where e k = k 2 /2m and the V kk f = I d 3 rV(r)e^ kf ^ 



•r 



are the 
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matrix elements of the two-body interaction. 

Recall fc, k f correspond to energies at the Fermi level ep in 
the absence of V. The question was posed by Cooper, is it 
possible to find an eigenvalue E < 2ep, i.e. a bound state of 
the two electrons? To simplify the problem, Cooper assumed 
a model form for V^y in which 



Vhh' — 



otherwise 

where as usual = e& — ep- The BCS interaction is some- 
times referred to as a "residual" attractive interaction, i.e. 
what is left when the long-range Coulomb interaction has 
been absorbed into renormalized electron-phonon coupling 
constants, etc., as in (1). The bound state equation becomes 

u k = k (7) 

where the prime on the summation in this context means 
sum only over k such that ej < e k < ep + cu c . Now u k may 
be eliminated from the equation by summing both sides 
yielding 

1,1 

V~^2e^E (8) 

* -N de^- = -N log 2 ^ + 2u c -E (9) 
2 Uie ^ 2e-E 2 u 6 2e F -E y J 

For a weak interaction NqV < 1, we expect a solution (if at 
all) just below the Fermi level, so we treat 2ep — E as a small 
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positive quantity, e.g. negligible compared to 2uj c . We then 
arrive at the pair binding energy 

Acooper = 2e F - E ~ 2uu c e- 2 ^ v . (10) 

There are several remarks to be made about this result. 

1. Note (for your own information-Cooper didn't know this 
at the time!) that the dependence of the bound state en- 
ergy on both the interaction V and the cutoff frequency uo c 
strongly resembles the famous BCS transition temperature 
dependence, with cu c identified as the phonon frequency 
cj£>, as given in equation (1.1). 

2. the dependence on V is that of an essential singularity, 
i.e. a nonanalytic function of the parameter. Thus we 
may expect never to arrive at this result at any order in 
perturbation theory, an unexpected problem which hin- 
dered theoretical progress for a long time. 

3. The solution found has isotropic or s-symmetry, since it 
doesn't depend on the k on the Fermi surface. (How would 
an angular dependence arise? Look back over the calcula- 
tion.) 

4. Note the integrand (2ck — E)~ l = (2£& + Acooper) -1 peaks 
at the Fermi level with energy spread Ac 00 per of states 
involved in the pairing. The weak-coupling (NqV <C 1) 
solution therefore provides a bit of a posteriori justifica- 
tion for its own existence, since the fact that Ac 00 per <C oo c 
implies that the dependence of V^y on energies out near 
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the cutoff and beyond is in fact not terribly important, so 
the cutoff procedure used was ok. 

5. The spread in momentum is therefore roughly Acooper/^F, 
and the characteristic size of the pair (using Heisenberg's 
uncertainty relation) about vp/T c . This is about 100- 
1000A in metals, so since there is of order 1 electron/ unit 
cell in a metal, and if this toy calculation has anything 
to do with superconductivity, there are certainly many 
electron pairs overlapping each other in real space in a 
superconductor. 

5.4 Pair condensate & BCS Wavefctn. 

Obviously one thing is missing from Cooper's picture: if it is 
energetically favorable for two electrons in the presence of a 
noninteracting Fermi sea to pair, i.e. to form a bound state, 
why not have the other electrons pair too, and lower the en- 
ergy of the system still further? This is an instability of the 
normal state, just like magnetism or charge density wave for- 
mation, where a ground state of completely different charac- 
ter (and symmetry) than the Fermi liquid is stabilized. The 
Cooper calculation is a T=0 problem, but we expect that as 
one lowers the temperature, it will become at some critical 
temperature T c energetically favorable for all the electrons to 
pair. Although this picture is appealing, many things about 
it are unclear: does the pairing of many other electrons alter 
the attractive interaction which led to the pairing in the first 
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place? Does the bound state energy per pair change? Do all 
of the electrons in the Fermi sea participate? And most im- 
portantly, how does the critical temperature actually depend 
on the parameters and can we calculate it? 

5.5 BCS Model. 

A consistent theory of superconductivity may be constructed 
either using the full "effective interaction" or our approxima- 
tion V(q, uS) to it. However almost all interesting questions 
can be answered by the even simpler model used by BCS. 
The essential point is to have an attractive interaction for 
electrons in a shell near the Fermi surface; retardation is sec- 
ondary. Therefore BCS proposed starting from a phenomeno- 
logical Hamiltonian describing free electrons scattering via an 
effective instantaneous interaction a la Cooper: 

kk'q 
aa 1 

where the prime on the sum indicates that the energies of the 
states k and k' must lie in the shell of thickness cup. Note the 
interaction term is just the Fourier transform of a completely 
local 4- Fermi interaction ^(r)^(r)ifj(r)ifj(r) 2 

Recall that in our discussion of the instability of the nor- 
mal state, we suggested that an infinitesimal pair field could 
produce a finite amplitude for pairing. That amplitude was 
the expectation value (c\ a cL k _ a ) . We ignore for the moment 

2 Note this is not the most general form leading to superconductivity. Pairing in higher angular momentum 
channels requires a bilocal model Hamiltonian, as we shall sec later. 
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the problems with number conservation, and ask if we can 
simplify the Hamiltonian still further with a mean field ap- 
proximation, again to be justified a posteriori. We proceed 
along the lines of generalized Hartree-Fock theory, and rewrite 
the interaction as 

C ka C -k+qa fC -k'+qa' c k'a = K c L C -fc+g<7') + S^C^)} X 

x[(C-k'+qa'Ck'a) +*(cc)], (12) 

where, e.g. S(cc) = c_ k ^ qo >c k > G - (c.^+^/c^) is the fluctu- 
ation of this operator about its expectation value. If a mean 
field description is to be valid, we should be able to neglect 
terms quadratic in the fluctuations when we expand Eq (20). 
If we furthermore make the assumption that pairing will take 
place in a uniform state (zero pair center of mass momentum) , 
then we put {c_ k ^ qa >c k > a ) = (c^vc^}^. The effective 
Hamiltonian then becomes (check!) 

H~H - (Az c y_ H + h.c.) + A fc (4 T cl ki )*, (13) 

k 

where 

A = V J2(c- ki c kl ) . (14) 

k 

What BCS (actually Bogoliubov, after BCS) did was then to 
treat the order parameter A as a (complex) number, and 
calculate expectation values in the approximate Hamiltonian 

(13) , insisting that A be determined self-consistently via Eq. 

(14) at the same time. 
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5.5.1 BCS wave function, gauge invariance, and number con- 
servation. 



What BCS actually did in their original paper is to treat the 
Hamiltonian (11) variationally. Their ansatz for the ground 
state of (11) is a trial state with the pairs k ^—k I occupied 
with amplitude v k and unoccupied with amplitude u k , such 
that l^l 2 + \vk\ 2 = 1: 

if; >= U k (u k + v k c{^_ ki )\0 > . (15) 

This is a variational wave function, so the energy is to be 
minimized over the space of u kl v k . Alternatively, one can 
diagonalize the Hartree-Fock (BCS) Hamiltonian directly, to- 
gether with the self-consistency equation for the order param- 
eter; the two methods turn out to be equivalent. I will follow 
the latter procedure, but first make a few remarks on the form 
of the wave function. First, note the explicit violation of par- 
ticle number conservation: \ip > is a superposition of states 
describing 0, 2, 4 , N-particle systems. 3 In general a quan- 
tum mechanical system with fixed particle number N (like, 
e.g. a real superconductor!) manifests a global U(l) gauge 
symmetry, because H is invariant under c\ a — > e l0 c\ a . The 
state > is characterized by a set of coefficients {i^,^}, 
which becomes {uk,e 2l0 Vk} after the gauge transformation. 
The two states \ip > and ^ (</>), where (j) = 20, are inequiva- 
lent, mutually orthogonal quantum states, since they are not 

3 What happened to the odd numbers? In mesoscopic superconductors, there are actually differences in the 
properties of even and odd-number particle systems, but for bulk systems the distinction is irrelevant. 
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simply related by a multiplicative phase factor. 4 Since H 
is independent of 0, however, all states > are contin- 

uously degenerate, i.e. the ground state has a U(l) gauge 
(phase) symmetry. Any state > is said to be a bro- 

ken symmetry state, becaue it is not invariant under a C7 (1) 
transformation, i.e. the system has "chosen" a particular (j) 
out of the degenerate range < (j) < 2tv. Nevertheless the ab- 
solute value of the overall phase of the ground state is not an 
observable, but its variations £0(r, i) in space and time are. 
It is the rigidity of the phase, i.e. the energy cost of any of 
these fluctuations, which is responsible for superconductivity. 

Earlier I mentioned that it was possible to construct a num- 
ber conserving theory. It is now instructive to see how: states 
of definite number are formed [Anderson 1958] by making co- 
herent superpositions of states of definite phase 

\i/>(N) >= d^ N ' 2 \^) > . (16) 

[The integration over <j) gives zero unless there are in the ex- 
pansion of the product contained in \ip > precisely N/2 pair 
creation terms, each with factor exp i<fi] Note while this state 
has maximal uncertainty in the value of the phase, the rigidity 
of the system to phase fluctuations is retained. 5 

It is now straightforward to see why BCS theory works. 
The BCS wave function \tfj > may be expressed as a sum 
ijj >= EjvO/vl^TV) > [Convince yourself of this by calculat- 

4 In the normal state, \ip > and tp(<f>) differ by a global multiplicative phase e 10 , which has no physical consequences, 
and the ground state is nondegenerate. 

5 The phase and number are in fact canonically conjugate variables, [N/2,cp] = i, where N = 2id/d<fi in the cj> 
representation. 
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ing the explicitly!]. IF we can show that the distribution of 
coefficients is sharply peaked about its mean value < N >, 
then we will get essentially the same answers as working with 
a state of definite number N =< N >. Using the explicit 
form (23) , it is easy to show 



(N) = (4>\j:n ka \^} = 2j:\v k \ 2 ; ((N — (W)) 2 ) = T,u\v\. 

ka k k 

(17) 

Now the Uk and Vk will typically be numbers of order 1, so 
since the numbers of allowed k-states appearing in the k sums 
scale with the volume of the system, we have < N >^ V, 
and < (N— < N >) 2 >~ V. Therefore the width of the 
distribution of numbers in the BCS state is < (N— < N > 
f > 1 / 2 I < N >~ N~ 1 / 2 . As N -> 10 23 particles, this 
relative error implied by the number nonconservation in the 
BCS state becomes negligible. 



5.5.2 Is the BCS order parameter general? 

Before leaving the subject of the phase in this section, it is 
worthwhile asking again why we decided to pair states with 
opposite momenta and spin, k | and —k[. The BCS argu- 
ment had to do 1) with minimizing the energy of the entire 
system by giving the Cooper pairs zero center of mass mo- 
mentum, and 2) insisting on a spin singlet state because the 
phonon mechanism leads to electron attraction when the elec- 
trons are at the same spatial position (because it is retarded 
in time!), and a spatially symmetric wavefunction with large 
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amplitude at the origin demands an antisymmetric spin part. 
Can we relax these assumptions at all? The first require- 
ment seems fairly general, but it should be recalled that one 
can couple to the pair center of mass with an external mag- 
netic field, so that one will create spatially inhomogeneous 
(finite-q) states with current flow in the presence of a mag- 
netic field. Even in zero external field, it has been proposed 
that systems with coexisting antiferromagnetic correlations 
could have pairing with finite antiferromagnetic nesting vec- 
tor Q [Baltensberger and Strassler 196?]. The requirement 
for singlet pairing can clearly be relaxed if there is a pair- 
ing mechanism which disfavors close approach of the paired 
particles. This is the case in superfluid 3 i7e, where the hard 
core repulsion of two 3 He atoms supresses T c for s-wave, sin- 
glet pairing and enhances T c for p-wave, triplet pairing where 
the amplitude for two particles to be together at the origin is 
always zero. 

In general, pairing is possible for some pair mechanism if 
the single particle energies corresponding to the states ka and 
k'(7 f are degenerate, since in this case the pairing interaction 
is most attractive. In the BCS case, a guarantee of this degen- 
eracy for k t and — k I in zero field is provided by Kramer's 
theorem, which says these states must be degenerate because 
they are connected by time reversal symmetry. However, 
there are other symmetries: in a system with inversion sym- 
metry, parity will provide another type of degeneracy, so k f, 
k 1 5 — k T an d — k I are all degenerate and may be paired 
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with one another if allowed by the pair interaction. 



5.6 Thermodynamics 

5.6.1 Bogoliubov transformation 

We now return to (13) and discuss the solution by canonical 
transformation given by Bogoliubov. After our drastic ap- 
proximation, we are left with a quadratic Hamiltonian in the 
c's, but with and cc terms in addition to cVs. We can di- 
agonalize it easily, however, by introducing the quasiparticle 
operators 7ko and 7ki by 

c kT = <7ko + ^k7ki 
c- k | = -<7ko + ^k7ki- (18) 

You may check that this transformation is canonical (preserves 
fermion comm. rels.) if |i/k| 2 + |^k| 2 = 1- Substituting into 
(13) and using the commutation relations we get 

H BC s = £6<[(K| 2 - M 2 )(7ko7ko + 7ki7ki) + 2|v k | 2 

k 

+ 2 «7ki7ko + 2ii k ^k7ki7ko] 

+ E[(A k ^ k < + AX^k)(7ko7ko + 7ki7ki - 1) 

k 

+(A k < 2 - AX 2 ) 7k i7ko + ( A k«k " A k z4) 7 ko7ki 
+A k (c kT ci ki r, (19 



which does not seem to be enormous progress, to say the least. 
But the game is to eliminate the terms which are not of the 
form 7^7, so to be left with a sum of independent number-type 
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terms whose eigenvalues we can write down. The coefficients 
of the and 77 type terms are seen to vanish if we choose 



2£ k u k v k + Alvl - A k u 2 k = 0. (20) 



This condition and the normalization condition u k + v k \ — 
1 are both satisfied by the solutions 

(21) 



Kl 2 


1 




M 2 


2 





where I defined the Bogolibov quasiparticle energy 



E k = + |A k |2. (22) 
The BCS Hamiltonian has now been diagonalized: 
H B cs = T,E k WoTko + 7ki7ki)+E (& - ^k + A k (cj cT cl ki )* 

k v ' k v 

(23) 

Note the second term is just a constant, which will be impor- 
tant for calculating the ground state energy accurately The 
first term, however, just describes a set of free fermion exci- 
tations above the ground state, with spectrum E k . 



5.6.2 Density of states 

The BCS spectrum is easily seen to have a minimum A k for 
a given direction k on the Fermi surface; A k therefore, in 
addition to playing the role of order parameter for the su- 
perconducting transition, is also the energy gap in the 1- 
particle spectrum. To see this explicitly, we can simply do 



19 



a change of variables in all energy integrals from the normal 
metal eigenenergies £k to the quasiparticle energies E^: 

N(E)dE = N N (£)d£. (24) 

If we are interested in the standard case where the gap A is 
much smaller than the energy over which the normal state dos 
Nn(0 varies near the Fermi level, we can make the replace- 
ment 

N N (0 ~ N N (p) = N , (25) 
so using the form of E^ from (22) we find 

N(E) f ,J A , E > A 



Nt 



o 



E < A ' 



(26) 



This function is sketched in Figure 5. 
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Figure 5: a) Normalized density of states; b) Quasiparticle spectrum. 
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5.6.3 Critical temperature 



The critical temperature is defined as the temperature at 
which the order parameter Ak vanishes. We can now cal- 
culate this with the aid of the diagonalized Hamiltonian. The 
self-consistency condition is 

= V £ u k ><'(l - 7ko7ko - 7ki7ki> 

k' 

= VE^(l-2f(E k )). (27) 

Since 1 — 2f(E) = tanh[£'/(2T)], the BCS gap equation 
reads 



A* = y E 4^ tanh % 



(28) 



This equation may now be solved, first for the critical temper- 
ature itself, i.e. the temperature at which A — > 0, and then 
for the normalized order parameter A/T c for any temperature 
T. It is the ability to eliminate all normal state parameters 
from the equation in favor of T c itself which makes the BCS 
theory powerful. For in practice the parameters cj£>, A^o, and 
particularly V are known quite poorly, and the fact that two 
of them occur in an exponential makes an accurate first prin- 
ciples calculation of T c nearly impossible. You should always 
be suspicious of a theory which claims to be able to calculate 
T c ! On the other hand, T c is easy to measure, so if it is the 
only energy scale in the theory, we have a tool with enormous 
predictive power. 
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First note that at T c , the gap equation becomes 

1 =r^tanhj^. (29) 



N V JU 2T C 

This integral can be approximated carefully, but it is useful 
to get a sense of what is going on by doing a crude treatment. 
Note that since T c <C lud generally, most of the integrand 
weight occurs for £ > T, so we can estimate the tanh factor 
by 1. The integral is log divergent, which is why the cutoff 
lud is so important. We find 

^ 7 = log^ =► Te-uae-W (30) 
1 v o n) J-c 

The more accurate analysis of the integral gives the BCS result 



T c = l.Uuj D e- 1/NoV (31) 



We can do the same calculation nearT Cl expanding to lead- 
ing order in the small quantity A(T)/T, to find A(T)/T C ~ 
3.06(1 - T/T c ) 1 ' 2 . At T = we have 

1 = C d ^ = K D dEN(E)/E (32) 



N V m E k 

= Ia D dE VE 2_ A 2 - ln ( 2 ^/A), (33) 

so that A(0) ~ 2cj jD exp-l/7V l/, or A(0)/T c ~ 1.76. The 
full temperature dependence of A(T) is sketched in Figure 6). 
In the halcyon days of superconductivity theory, comparisons 
with the theory had to be compared with a careful table of 
A/T c painstakingly calculated and compiled by Miihlschlegl. 
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1 .76 T, 




T. 



♦ T 



Figure 6: BCS order parameter as fctn. of T. 



Nowadays the numerical solution requires a few seconds on 
a PC. It is frequently even easier to use a phenomenological 
approximate closed form of the gap, which is correct near 
T = and = T c : 



A(T) = 5 sc T c tanh{ 



7T 



5C ,T C . 
a^(^ - 1)}. 



(34) 



where S sc = A(0)/T c = 1.76, a = 2/3, and SC/C N = 1.43 
is the normalized specific heat jump. 6 This is another of the 
"universal" ratios which the BCS theory predicted and which 
helped confirm the theory in classic superconductors. 



5.6.4 Specific heat. 

The gap in the density of states is reflected in all thermody- 
namic quantities as an activated behavior e~ A//T , at low T, 
due to the exponentially small number of Bogoliubov quasi- 

6 Note to evaluate the last quantity, we need only use the calculated temperature dependence of A near T c , and 
insert into Eq. (47). 



23 



particles with thermal energy sufficient to be excited over the 
gap A at low temperatures T <C A. The electronic specific 
heat is particularly easy to calculate, since the entropy of the 
BCS superconductor is once again the entropy of a free gas 
of noninteracting quasiparticles, with modified spectrum E^. 
The expression (II. 6) then gives the entropy directly, and may 
be rewritten 

S = -k B J °°dEN(E){f(E)lnf(E)+[l-f(E)]ln[l-f(E)]}, 

(35) 

where f(E) is the Fermi function. The constant volume spe- 
cific heat is just C e \y = T[dS/dT]y, which after a little alge- 
bra may be written 

C dy = ^!dEN(E)(^)[E 2 - l -T^\. (36) 

A sketch of the result of a full numerical evaluation is shown 
in Figure 1. Note the discontinuity at T c and the very rapid 
falloff at low temperatures. 

It is instructive to actually calculate the entropy and specific 
heat both at low temperatures and near T c . For T <C A, 
f(E) ~ e~ E / T and the density of states factor N(E) in the 
integral cuts off the integration at the lower limit A, giving 
C~ (W A 5 / 2 /T 3 / 2 )e- A / T . 7 

7 To obtain this, try the following: 

• replace the derivative of Fermi function by exp-E/T 

• do integral by parts to remove singularity at Delta 

• expand around Delta E = Delta + delta E 

• change integration variables from E to delta E 
Somebody please check my answer! 
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Note the first term in Eq. (47) is continuous through the 
transition A — > (and reduces to the normal state specific 
heat (27T 2 /3)NqT above T c ), but the second one gives a dis- 
continuity at T c of (Cn — Cs)/Cn = 1-43, where Cs = C(T~) 
and Cn = C(T C + ). To evaluate (36), we need the T depen- 
dence of the order parameter from a general solution of (28) . 



5.7 Electrodynamics 

5.7.1 Linear response to vector potential 

The existence of an energy gap is not a sufficient condition for 
superconductivity (actually, it is not even a necessary one!). 
Insulators, for example, do not possess the phase rigidity 
which leads to perfect conductivity and perfect diamagnetism 
which are the defining characteristics of superconductivity. 
We can understand both properties by calculating the cur- 
rent response of the system to an applied magnetic or electric 
field. The kinetic energy in the presence of an applied vector 
potential A is just 

H = ^-Jll d 3 rxPl(r)[-iV - (-)A]fy a (r), (37) 
zm ° c 

and the second quantized current density operator is given by 

j(r) = — {i/j\r)(-iV - -AWr) + \(iV - -AU\r)U(r)} (38) 
2m c c 

= } P ara - — ^(r)^(r)A, (39) 
mc 
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where 



harair) = ^{^(r)V^(r) - (V^(r)Mr)}, (40) 
or in Fourier space, 

6 + 

jpara(q) = — E k4_ q(J C k(J (41) 
772 kcr 

We would like to do a calculation of the linear current re- 
sponse j(q,uj) to the application of an external field A(q,u) 
to the system a long time after the perturbation is turned 
on. Expanding the Hamiltonian to first order in A gives the 
interaction 

H' = J d 3 rj para • A = — E k • A(q)4_ q(J c q(J . (42) 

mc ka 

The expectation value < j > may now be calculated to linear 
order via the Kubo formula, yielding 

{j)(q,u) = K(q,u)A(q,u) (43) 

with 

2 

TIC 

K(q,u) = + ([3para^para])(^Lu). (44) 

mc 

Note the first term in the current 

2 

TIC 

jd*a(q, u) = A(q, u) (45) 

mc 

is purely diagmagnetic, i.e. these currents tend to screen the 
external field (note sign). The second, paramagnetic term is 
formally the Fourier transform of the current-current corre- 
lation function (correlation function used in the sense of our 
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discussion of the Kubo formula). Here are a few remarks on 
this expression: 

• Note the simple product structure of (43) in momentum 
space implies a nonlocal relationship in general between j 
and A., i.e. j(r) depends on the A(r') at many points r' 
around r. 9 

• Note also that the electric field in a gauge where the elec- 
trostatic potential is set to zero may be written E(q, uj) = 
— icjA(q, uj), so that the complex conductivity of the sys- 
tem defined by j = aE may be written 



• What happens in a normal metal? The paramagnetic sec- 
ond term cancels the diamagnetic response at uj = 0, 
leaving no real part of K (Im part of <r), i.e. the conduc- 
tivity is purely dissipative and not inductive at cj, q = 
in the normal metal. 

5.7.2 Meissner Effect. 

There is a theorem of classical physics proved by Bohr 10 which 
states that the ground state of a system of charged particles 



8 We will see that the first term gives the diamagnetic response of the system, and the second the temperature- 
dependent paramagnetic response. 

9 If we transformed back, we'd get the convolution 



10 See "The development of the quantum-mechanical electron theory of metals: 1928-33." L. Hoddeson and G. 
Baym, Rev. Mod. Phys., 59, 287 (1987). 





j(r)= / <2W(r,r')A(r') 



(46) 
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in an external magnetic field carries zero current. The essen- 
tial element in the proof of this theorem is the fact that the 
magnetic forces on the particles are always perpendicular to 
their velocities. In a quantum mechanical system, the three 
components of the velocity do not commute in the presence 
of the field, allowing for a finite current to be created in the 
ground state. Thus the existence of the Meissner effect in 
superconductors, wherein magnetic flux is expelled from the 
interior of a sample below its critical temperature, is a clear 
proof that superconductivity is a manifestation of quantum 
mechanics. 

The typical theorists' geometry for calculating the penetra- 
tion of an electromagnetic field into a superconductor is the 
half-space shown in Figure 7, and compared to schematics of 
practical experimental setups involving resonant coils and mi- 
crowave cavities in Figs. 7 a)-c). In the gedanken experiment 




a) b) c) 

Figure 7: a) Half-space geometry for penetration depth calculation; b) Resonant coil setup; c) 
Microwave cavity 
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Case, cL DC field is applied parallel to the sample surface, and 
currents and fields are therefore functions only of the coordi- 
nate perpendicular to the surface, A = A(z), etc. Since we 
are interested in an external electromagnetic wave of very long 
wavelength compared to the sample size, and zero frequency, 
we need the limit uj = 0, q — > oo of the response. We will 
assume that in this limit K(0, 0) — » const, which we will call 
— (c/47r)A~ 2 for reasons which will become clear! Equation 
(63) then has the form 

j = -^A-A, (48) 

This is sometimes called London's equation, which must be 
solved in conjunction with Maxwell's equation 

V x B = -V 2 A = — j = -A" 2 A, (49) 

c 

which immediately gives A ~ e~ z / A , and B = Boe~ Z//A . The 
currents evidently screen the fields for distances below the 
surface greater than about A. This is precisely the Meissner 
effect, which therefore follows only from our assumption that 
K(0,0) = const. A BCS calculation will now tell us how the 
"penetration depth" A depends on temperature. 

Evaluating the expressions in (44) in full generality is te- 
dious and is usually done with standard many-body methods 
beyond the scope of this course. However for q = the cal- 
culation is simple enough to do without resorting to Green's 
functions. First note that the perturbing Hamiltonian H' may 



29 



be written in terms of the quasiparticle operators (18) as 

H' = (50 

6 r + + 

E k • A(q) {{u k u k + q + ^k+q)(7k+ q o7ko - 7k+ q i7ki) 

mc k 

+ (WkW k+ q - W k Wk+q)(7k+q07kl ~ 7k+ql7ko)l 



-» -— Ek- A(0)( 7ko7k o - 7ki7ki) (51) 
mc k 

If you compare with the A = Hamiltonian (23), we see that 
the new excitations of the system are 

E k0 ~^E k -—k- A(0) 

mc 

E kl ~^E k + —k • A(0) (52) 

mc 

We may similarly insert the quasiparticle operators (18) into 
the expression for the expectation value of the paramagnetic 
current operator (41): 

6 + + 

(W«(q = 0)) = — E((7ko7ko - 7ki7ki)) 

m k 

= -E(/(£ k0 )-/(£ k i)). (53) 
m k 

We are interested in the linear response A — > 0, so that when 
we expand wrt A, the paramagnetic contribution becomes 

2e 2 ( df \ 

(i P ara(ci = 0)> = — X [k • A(0)] k (--^H . (54) 



77TC k 



as 



k^ 



Combining now with the definition of the response function 
K and the diamagnetic current (45), and recalling E k — > 
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N id^ k (dQ/A7r), with N = 3n/(2e F ) and /(d^/47r)kk = 
1/3, we get for the static homogeneous response is therefore 



£(0,0) = 



— ne 



mc 



{1- 



-df 



<9£ 



_ -n s (T)e : 



1 



(55) 
(56) 



where in the last step, I defined the superfluid density to be 
n s (T) = n — n n (T), with normal fluid density 




(57) 



Note at T = 0, — df/dE^ — » 0, [Not a delta function, as in 
the normal state case-do you see why?] , while at T = T c the 
integral n n — » l. 11 Correspondingly, the superfluid density as 
defined varies between n at T = and at T c . This is the BCS 
microscopic justification for the rather successful phenomeno- 
logical two-fluid model of superconductivity: the normal fluid 
consists of the thermally excited Bogoliubov quasiparticle gas, 
and the superfluid is the condensate of Cooper pairs. 12 

Now let's relate the BCS microscopic result for the static 
homogeneous response to the penetration depth appearing in 
the macroscopic electrodynamics calculation above. We find 
immediately 



A(T) = ( 



mc 



4irn s (T)e 2 



1/2 



(58) 



11 The dimensioness function n n (T/T c )/n is sometimes called the Yoshida function, Y(T), and is plotted in Fig. 8. 

12 The BCS theory and subsequent extensions also allow one to understand the limitations of the two-fluid picture: 
for example, when one probes the system at sufficiently high frequencies oj ~ A, the normal fluid and superfluid 
fractions are no longer distinct. 
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a) b) c) 

Figure 8: a) Yoshida function; b) superfluid density ; c) penetration depth 

At T = 0, the super cur rent screening excludes the field from 
all of the sample except a sheath of thickness A(0). At small 
but finite temperatures, an exponentially small number of 
quasiparticles will be excited out of the condensate, depeleting 
the supercurrent and allowing the field to penetrate further. 
Both n n (T) and A(T) — A(0) may therefore be expected to 
vary as e~ A / T for T <C T c , as may be confirmed by explicit 
expansion of Eq. (57). [See homework.] Close to T c , the pen- 
etration depth diverges as it must, since in the normal state 
the field penetrates the sample completely. 



5.7.3 Dynamical conductivity. 

The calculation of the full, frequency dependent conductivity 
is beyond the scope of this course. If you would like to read an 
old-fashioned derivation, I refer you to Tinkham's book. The 
main point to absorb here is that, as in a semiconductor with 
a gap, at T = there is no process by which a photon can 
be absorbed in a superconductor until its energy exceeds 2 A, 
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the binding energy of the pair. This "threshold" for optical 
absorption is one of the most direct measurements of the gaps 
of the old superconductors. 



6 Ginzburg-Landau Theory 

6.1 GL Free Energy 

While the BCS weak-coupling theory we looked at the last 
two weeks is very powerful, and provides at least a qualita- 
tively correct description of most aspects of classic supercon- 
ductors, 13 there is a complementary theory which a) is simpler 
and more physically transparent, although valid only near the 
transition; and b) provides exact results under certain circum- 
stances. This is the Ginzburg-Landau theory [ V.L. Ginzburg 
and L.D. Landau, Zh. Eksp. Teor. Fiz. 20, 1064 (1950)], 
which received remarkably little attention in the west until 
Gor'kov showed it was derivable from the BCS theory. [L.P. 
Gor'kov, Zh. Eksp. Teor Fiz. 36, 1918 (1959)]. The the- 
ory simply postulated the existence of a macrosopic quantum 
wave function ip(r) which was equivalent to an order param- 
eter, and proposed that on symmetry grounds alone, the free 
energy density of a superconductor should be expressible in 
terms of an expansion in this quantity: 

t^-k = aH 2 + m 4 + 1 |(v + ^ 2 
V 2m* c 

13 In fact one could make a case that the BCS theory is the most quantitatively accurate theory in all of condensed 
matter physics 
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where the subscripts n and s refer to the normal and super- 
conducting states, respectively. 

Let's see why GL might have been led to make such a 
"guess" . The superconducting-normal transition was empiri- 
cally known to be second order in zero field, so it was natural 
to write down a theory analogous to the Landau theory of 
a ferromagnet, which is an expansion in powers of the mag- 
netization, M. The choice of order parameter for the super- 
conductor corresponding to M for the ferromagnet was not 
obvious, but a complex scalar field ip was a natural choice 
because of the analogy with liquid i7e, where |^| 2 is known 
to represent the superfluid density n s ; 14 a quantum mechani- 
cal density should be a complex wave function squared. The 
biggest leap of GL was to specify correctly how electromag- 
netic fields (which had no analog in superfluid He) would 
couple to the system. They exploited in this case the simi- 
larity of the formalism to ordinary quantum mechanics, and 
coupled the fields in the usual way to "charges" e* associated 
with "particles" of mass m*. Recall for a real charge in a 
magnetic field, the kinetic energy is: 

1 1 f — * 

< *|?4m|* > = / d 3 r**(V + -A)H (60) 

= ^/dV|(v + ^4>| 2 , (6i) 

after an integration by parts in the second step. GL just re- 
placed e, m with e*, m* to obtain the kinetic part of Eq. (59); 



14 



tp in the He case has the microscopic interpretation as the Bosc condensate amplitude. 
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they expected that e* and rrf were the elementary electron 
charge and mass, respectively, but did not assume so. 



8f 




Figure 9: Mexican hat potential for superconductor. 



A system described by this free energy will undergo a second- 
order phase transition in zero field when a = 0: clearly when 
a is positive, the system can minimize Sf by having i/j = (no 
superconductivity), whereas if a is negative, Sf has a mini- 
mum with tfj ^ 0. The free energy (59) is a functional of 
the order parameter 0, meaning the actual value of the order 
parameter realized in equilibrium satisfies Sf /Sip = 0. 15 No- 
tice / is independent of the phase (j) of the order parameter, 
i/j = |0|e^, and so the ground state for a < is equivalent 
to any state ij) related to it by multiplication by a pure phase. 
This is the C7 (1) gauge invariance of which we spoke earlier. 

15 Thus you should not be perturbed by the fact that / apparently depends on ip even for a > 0. The value of / 
in equilibrium will be /„ = f[ip = 0]. 
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This symmetry is broken when the system chooses one of the 
ground states (phases) upon condensation (Fig 1.). 

For a uniform system in zero field, the total free energy 
F = I d 3 rf is minimized when / is, so one find for the order 
parameter at the minimum, 

1/2 



1> 



eq 
eq 



—a 
2b 

= 



a < 
a > 0. 



(62) 
(63) 



When a changes sign, a minimum with a nonzero value be- 
comes possible. For a second order transition as one lowers the 
temperature, we assume that a and b are smooth functions of 
T near T c . Since we are only interested in the region near T c , 
we take only the leading terms in the Taylor series expansions 
in this region: a(T, H) = ao(T — T c ) and b = constant. Eqs. 
(62) and (63) take the form: 



2b 

= 0. 



T<T C 
T > T r . 



(64) 
(65) 



Substituting back into Eqs. 59, we find: 



UT) - f n (T) = - a f b (T c - T)\ T<T C 
MT) - f n (T) =0, T > T c . 



(66) 
(67) 



The idea now is to calculate various observables, and de- 
termine the GL coefficients for a given system. Once they 
are determined, the theory acquires predictive power due to 
its extreme simplicity. It should be noted that GL theory is 
applied to many systems, but it is in classic superconductors 
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that it is most accurate, since the critical region, where de- 
viations from mean field theory are large, is of order 10~ 4 or 
less. Near the transition it may be taken to be exact for all 
practical purposes. This is not the case for the HTSC, where 
the size of the critical region has been claimed to be as much 
as 10-20K in some samples. 

S up ere ur rents. Let's now focus our attention on the 
term in the GL free energy which leads to supercurrents, the 
kinetic energy part: 

1 ?p* -> 

Fkm = ^ 3r 2^ l(v + T^ P (68) 

= /d 3 r^[(V|^|) 2 + (V</)-e7 C A) 2 ^| 2 ]. (69) 

These expressions deserve several remarks. First, note that 

the free energy is gauge invariant, if we make the transforma- 
— * — * 

tion A —> A + VA, where A is any scalar function of position, 
while at the same time changing ip — > ip exp(— ie*A/c) . Sec- 
ond, note that in the last step above I have split the kinetic 
part of / into a term dependent on the gradients of the order 
parameter magnitude |^| and on the gradients of the phase 
(j). Let us use a little intuition to guess what these terms 
mean. The energy of the superconducting state below T c is 
lower than that of the normal state by an amount called the 
condensation energy m From Eq. (59) in zero field this is of 
order very close to the transition. To make spatial vari- 
ations of the magnitude of ip must cost a significant fraction 
of the condensation energy in the region of space in which it 

16 We will see below from the Gorkov derivation of GL from BCS that it is of order 7V(0)A 2 . 
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occurs. 17 On the other hand, the zero-field free energy is ac- 
tually invariant with respect to changes in 0, so fluctuations 
of (j) alone actually cost no energy. 

With this in mind, let's ask what will happen if we apply 
a weak magnetic field described by A to the system. Since 
it is a small perturbation, we don't expect it to couple to \tfj 
but rather to the phase (j). The kinetic energy density shoulc 
then reduce to the second term in Eq. (69), and furthermore 
we expect that it should reduce to the intuitive two-fluid ex- 
pression for the kinetic energy due to supercurrents, ^mn s v 2 s . 

Recall from the superfluid He analogy, we expect |^| 2 = n* 
to be a kind of density of superconducting electrons, but that 
we aren't certain of the charge or mass of the "particles" . So 
let's put 

1 ?>* -> 1 1 

(70) 

Comparing with Eq. (xx), we find that the superfluid veloc- 
ity must be 

v s = — t (Vcf> + -A). (71) 
m* c 

Thus the gradient of the phase is related to the superfluid 
velocity, but the vector potential also appears to keep the 
entire formalism gauge-invariant. 

Meissner effect. The Meissner effect now follows imme- 
diately from the two-fluid identifications we have made. The 

17 We can make an analogy with a ferromagnet, where if we have a domain wall the magnetization must go to zero 
at the domain boundary, costing lots of surface energy. 
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super cur rent density will certainly be just 



e*n! , e* 



3s = -e*n* a v a = + —A). (72) 



m* c 



Taking the curl of this equation, the phase drops out, and we 
find the magnetic field: 



V x j s = -—^B. (73) 
Now recall the Maxwell equation 



m*c 



c 



js = ^VxB, (74) 



which, when combined with (14), gives 



e* 2 n 



— V x V x B = ~-V z B = ^B, (75) 

47T 47T m*c 

or 

X 2 V 2 B = B, (76) 

where 

* 2 1/2 

A = — (77) 
47re* 2 n* ' V ; 

Notice now that if we use what we know about Cooper pairs, 

this expression reduces to the BCS/London penetration depth. 

We assume e* is the charge of the pair, namely e* = 2e, and 

similarly m* = 2m, and |^| 2 = n* = n s /2 since n* is the 

density of pairs. 

Flux quantization. If we look at the flux quantization 
described in Part 1 of these notes, it is clear from our sub- 
sequent discussion of the Meissner effect, that the currents 
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which lead to flux quantization will only flow in a small part 
of the cross section, a layer of thickness A. This layer encloses 
the flux passing through the toroid. Draw a contour C in the 
interior of the toroid, as shown in Figure 10. Then v s = 
everywhere on C. It follows that 




Figure 10: Quantization of flux in a toroid. 

= f c dl- v s = f c dl- (V0 + ^4). (78) 

The last integral may be evaluated using 

f c dl • = 2tt x integer, (79) 

and 




= e j s dS-B (81) 

= (82) 
c 

Here S is a surface spanning the hole and $ the flux through 
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he 



(83) 



2e 



2e 



where n is a integer, $0 is the flux quantum, and I've rein- 
serted the correct factor of h in the first step to make the units 
right. Flux quantization indeed follows from the fact that the 
current is the result of a phase gradient. 18 

Derivation from Microscopic Theory. One of the 

reasons the GL theory did not enjoy much success at first 
was the fact that it is purely phenomenological, in the sense 
that the parameters ao, b, m* are not given within any micro- 
scopic framework. The BCS theory is such a framework, and 
gives values for these coefficients which set the scale for all 
quantities calculable from the GL free energy. The GL theory 
is more general, however, since, e.g. for strong coupling su- 
perconductors the weak coupling values of the coefficients are 
simply replaced by different ones of the same order of mag- 
nitude, without changing the form of the GL free energy. In 
consequence, the dependence of observables on temperature, 
field, etc, will obey the same universal forms. 

The GL theory was derived from BCS by Gor'kov. The 
calculation is beyond the scope of this course, but can be 
found in many texts. 



18 It is important to note, however, that a phase gradient doesn't guarantee that a current is flowing. For example, 
in the interior of the system depicted in Fig. 2, both V</> and A are nonzero in the most convenient gauge, and cancel 
each other! 
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6.2 Type I and Type II superconductivity 



Now let's look at the problem of the instability of the normal 
state to superconductivity in finite magnetic field H. A what 
magnetic field to we expect superconductivity to be destroyed, 
for a given T < T c . 19 Well, overall energy is conserved, so 
the total condensation energy of the system in zero field, f s — 
f n (T) of the system must be equal to the magnetic field energy 
ld 3 rH 2 /87r the system would have contained at the critical 
field H c in the absence of the Meissner effect. For a completely 
homogeneous system I then have 

f s (T) - f n (T) = -H 2 c /8n, (84 

and from Eq. (8) this means that, near T c , 



^f(T c -T). (85) 



Whether this thermodynamic critical field H c actually rep- 
resents the applied field at which flux penetrates the sample 
depends on geometry. We assumed in the simplified treat- 
ment above that the field at the sample surface was the same 
as the applied field. Clearly for any realistic sample placed 
in a field, the lines of field will have to follow the contour 
of the sample if it excludes the field from its interior. This 
means the value of H at different points on the surface will 
be different: the homogeneity assumption we made will not 
quite hold. If we imagine ramping up the applied field from 

19 Clcarly it will destroy superconductivity since it breaks the degenerace of between the two componenets of a 
Cooper pair. 
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zero, there will inevitably come a point H app i = ~H app i : c where 
the field at certain points on the sample surface exceeds the 
critical field, but at other points does not. For applied fields 
Happi,c < Happi < He, P ar t of the sample will then be normal, 
with local field penetration, and other parts will still exclude 
field and be superconducting. This is the intermediate state 
of a type I superconductor. The structure of this state for a 
real sample is typically a complicated "striped" pattern of su- 
perconducting and normal phases. Even for small fields, edges 
and corners of samples typically go normal because the field 
lines bunch up there; these are called "demagnetizing effects" , 
and must be accounted for in a quantitatively accurate mea- 
surement of, say, the penetration depth. It is important to 
note that these patterns are completely geometry dependent, 
and have no intrinsic length scale associated with them. 

In the 50's, there were several materials known, however, 
in which the flux in sufficiently large fields penetrated the 
sample in a manner which did not appear to be geometry de- 
pendent. For example, samples of these so-called "type II" 
superconductors with nearly zero demagnetizing factors (long 
thin plates placed parallel to the field) also showed flux pen- 
etration in the superconducting state. The type-II materials 
exhibit a second-order transition at finite field and the flux 
B through the sample varies continuously in the supercon- 
ducting state. Therefore the mixed state must have currents 
flowing, and yet the Meissner effect is not realized, so that the 
London equation somehow does not hold. 
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The answer was provided by Abrikosov in 1957 [A. A. A., 
Sov. Phys. JETP 5, 1174 (1957).] in a paper which Landau 
apparently held up for several years because he did not be- 
lieve it. Let us neglect the effects of geometry again, and go 
back to our theorist's sample with zero demagnetizing factor. 
Can we relax any of the assumptions that led to the Lon- 
don equation (72)? Only one is potentially problematic, that 
n*(r) = |^(r)| 2 = constant independent of position. Let's 
examine-as Abrikosov did-the energy cost of making spatial 
variations of the order parameter. The free energy in zero 
field is 



F = I d 3 r[a i/; 2 + 



1 



2m' 



or 



1 



—a 

where I've put 



= / d 3 r 


-v 2 


1 


1/2 = 


—2m* a 





vi 2 + e 2 |vvi 2 + — 



—a 



4 



(86) 



(87) 



1 

-2m*a {T c - T) 



1/2 



(88) 



Clearly the length £ represents some kind of stiffness of the 
quantitiy |^| 2 , the superfluid density. [Check that it does in- 
deed have dimensions of length!] If £, the so-called coherence 
length, is small, the energy cost of n s varying from place to 
place will be small. If the order parameter is somehow changed 
from its homogeneous equilibrium value at one point in space 
by an external force, £ specifies the length scale over which 
it "heals". We can then investigate the possibility that, as 
the kinetic energy of superfluid flow increases with increasing 
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field, if £ is small enough it might eventually become favorable 
to "bend" \tfj\ 2 instead. In typical type I materials, £(T = 0) 
is of order several hundreds or even thousands of Angstrom, 
but in heavy fermion superconductors, for example, coher- 
ence lengths are of order 50- 100 A. The smallest coherence 
lengths are attained in the HTSC, where ^ is of order 12- 
15 A, whereas £ c is only 2-3 A. 

The general problem of minimizing F when depends on 
position is extremely difficult. However, we are mainly in- 
terested in the phase boundary where ip is small, so life is a 
bit simpler. Let's recall our quantum mechanics analogy once 
more so as to write F in the form: 

F = j (fr[a\^\ 2 + 6H 4 ]+ < il>\Hkin\il> >, (89) 

where H^m is the operator 

1 ?p* -> 

(V + —A) 2 . (90) 



2m* o 

Now note 1) sufficiently close to the transition, we may 
always neglect the 4th-order term, which is much smaller; 
2) to minimize F, it suffices to minimize < H^ in >, since 
the | ip\ 2 term will simply fix the overall normalization. The 
variational principle of quantum mechanics states that the 
minimum value of < H > over all possible tjj is achieved 
when ij) is the ground state (for a given normalization of iji). 
So we need only solve the eigenvalue problem 

Hkm^j = Ejljjj (91) 
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for the lowest eigenvalue, Ej, and corresponding eigenfunction 
i/jj. For the given form of i^ n , this reduces to the classic 
quantum mechanics problem of a charged particle moving in 
an applied magnetic field. The applied field H is essentially 
the same as the microscopic field B since ip is so small (at the 
phase boundary only!). I'll remind you of the solution, due to 
to Landau, in order to fix notation. We choose a convenient 
gauge, 

A = -Hyx, (92) 
in which Eq. 44 becomes 

I v 2 2 

'l-i^Z + J-) - ^ - = Etfj, (93) 



2m* dx £ 2 M dy 2 dz 2 

where £m = (c/e*H) 1 / 2 is the magnetic length. Since the co- 
ordinates x and z don't appear explicitly, we make the ansatz 
of a plane wave along those directions: 

ip = r]{y)e lkxxMkzZ , (94) 

yielding 



^ [{K+ i/-df +kMy)=Eviy) - (95) 

But this you will recognize as just the equation for a one- 
dimensional harmonic oscillator centered at the point y = 
—k x £ 2 M with an additional additive constant k 2 z /2m* in the 
energy. Recall the standard harmonic oscillator equation 
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with ground state energy 



E = f = \{k/mfl\ 



(97) 



where k is the spring constant, and ground state wavefunc- 
tion corresponding to the lowest order Hermite polynomial, 

~ exp[-(rafc/4) 1/2 x 2 ]. (98) 

Let's just take over these results, identifying 

The ground state eigenfunction may then be chosen as 

7r/ 2 

= M^r 1/4 e* kMz eM-(y + kJ 2 M ) 2 /2f M % 
L y 

(100) 

where L y is the size of the sample in the y direction (L x L y L z = 
V = 1). The wave functions are normalized such that 



r 



4>k x , kz \=% (ioi) 

(since I set the volume of the system to 1). The pref actors 
are chosen such that tJjq represents the average superfluid den- 
stity. One important aspect of the particle in a field problem 
seen from the above solution is the large degeneracy of the 
ground state: the energy is independent of the variable k Xl 
for example, corresponding to many possible orbit centers. 

We have now found the wavefunction which minimizes < 
Hkin >■ Substituting back into (89), we find using (99) 

^ 7" 7" 

F = [a {T - T c ) + e — r ] J tfV|Vf + bj <fr\i)\\ (102) 
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When the coefficient of the quadratic term changes sign, we 
have a transition. The field at which this occurs is called the 
upper critical field 



/rriS 2m* can /rri . 
H c2 (T) = — °(T C -T). 



(103) 



What is the criterion which distinguishes type- 1 and type II 
materials? Start in the normal state for T < T c as shown in 
Figure 3, and reduce the field. Eventually one crosses either 
H c or H C 2 first. Whichever is crossed first determines the 
nature of the instability in finite field, i.e. whether the sample 
expels all the field, or allows flux (vortex) penetration (see 
section C). 



H 




Normal state 



Figure 11: Phase boundaries for classic type II superconductor. 

In the figure, I have displayed the situation where H C 2 is 
higher, meaning it is encountered first. The criterion for the 
dividing line between type 1 and type II is simply 



dH. 



c 



dH, 



c2 



dT 



dT 



(104) 
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or, using the results (38) and (56), 

(ra*) 2 c 2 6 1 



(105) 



7r(e*) 2 2' 

This criterion is a bit difficult to extract information from in 
its current form. Let's define the GL parameter k to be the 
ratio of the two fundamental lengths we have identified so far, 
the penetration depth and the coherence length: 

k = ^. (106) 



Recalling that 



s 

and 



, o rrfc 2 m*c 2 b , ^ 

A = — A 9— = -n 9- 107 

47re* z n* Z7re* z a 



? 2 = "^V- (108) 
2m*a 



The criterion (58) now becomes 

o m*c 2 6/27re* 2 a (m*) 2 c 2 b 1 

ft 2 = = " = 109 

l/2m*a 7re* 2 2 v ; 

Therefore a material is type I (II) if k is less than (greater 
than) \. In type-I superconductors, the coherence length is 
large compared to the penetration depth, and the system is 
stiff with respect to changes in the superfluid density This 
gives rise to the Meissner effect, where n s is nearly constant 
over the screened part of the sample. Type-II systems can't 
screen out the field close to H C 2 since their stiffness is too 
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small. The screening is incomplete, and the system must de- 
cide what pattern of spatial variation and flux penetration is 
most favorable from an energetic point of view. The result is 
the vortex lattice first described by Abrikosov. 



6.3 Vortex Lattice 

I commented above on the huge degeneracy of the wave func- 
tions (53). In particular, for fixed k z = 0, there are as many 
ground states as allowed values of the parameter k x . At H C 2 it 
didn't matter, since we could use the eigenvalue alone to de- 
termine the phase boundary. Below H C 2 the fourth order term 
becomes important, and the minimization of / is no longer an 
eigenvalue problem. Let's make the plausible assumption that 
if some spatially varying order parameter structure is going 
to form below i7 C 2, it will be periodic with period 27r/g, i.e. 
the system selects some wave vector q for energetic reasons. 
The x-dependence of the wave functions 

^ kz =o = U-rf )- 1/4 e^ex P [-( 2/ +M 2 M) 2 /24)]. (HO) 

L y 

is given through plane wave phases, e lkxX '. If we choose k x = 
qn x , with n x =integer, all such functions will be invariant 
under x — > x + 2tv jq. Not all n x s are allowed, however: the 
center of the "orbit", k x £ 2 M should be inside the sample: 

-L y /2 < k x f M = qf M n x < L y /2, (111) 
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Thus n x is restricted such that 



^ Tl x <C Ti max j C l „ (112) 



2^2 -max/- - - -max/- 

and the total number of degenerate functions is L y /(q£ 2 M ). 

Clearly we need to build up a periodic spatially varying 
structure out of wave functions of this type, with "centers" 
distributed somehow. What is the criterion which determines 
this structure? All the wave functions (110) minimize < 
Hkin >, and are normalized to f d 3 r\ifj\ 2 = |^o| 2 - They are 
all therefore degenerate at the level of the quadratic GL free 
energy, F = ! dPr\ip\ 2j \- < Hkin >■ The fourth order term 
must stabilize some linear combination of them. We therefore 
write 

4>(r) = T.C n ^ nx (r), (113) 
with the normalization e Ux \C nx \ 2 = 1, which enforces / d 3 r\ifj(r)\ 2 



q. Note this must minimize < Hkin >• Let's therefore 
choose the C Ux and q to minimize the remaining terms in F 1 
I d 3 r [a | ifj | 2 + b \ i/j | 4 ] . Substituting and using the normalization 
condition and orthogonality of the different ipu k x , we find 



.2 ; llA 



with 



and 



/ = a^o + HI (H4) 



a(H,T) = a (T-T c ) + ^§- c = £- c (H-H c2 (T)l (115) 
b(H) = sb, (116) 



iv^ ^1,^2,^x3^x4 
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J dz Jdx e ^- n ^- n X 2+n x?> +n xA )x x ^ 

, {-^[(y+^xl^M) 2 +(y+gn a; 2^M) 2 +(l/+gn a; 3^) 2 +(i/+gn x 4^) 2 ]} 

dy e m m (119) 

The form of f[ipo] is now the same as in zero field, so we 
immediately find that in equilibrium, 

_ ^.l/2 



M«, = (^y /z - (120) 



and 



I = IF (m) 

This expression depends on the variational parameters C Ux , q 
only through the quantity s appearing in b. Thus if we mini- 
mize s, we will minimize f (remember b > 0, so / < 0). The 
minimization of the complicated expression (xx) with con- 
straint J2 Ux \C nx \ 2 = 1 is difficult enough that A. Abrikosov 
made a mistake the first time he did it, and I won't inflict the 
full solution on you. To get an idea what it might look like, 
however, let's look at a very symmetric linear combination, 
one where all the C Ux s are equal: 

C n = Ti m J x . (122) 

Then 

^(r) - Ee inqx exp[-(y + nqi 2 ) 2 j2i 2 M \ (123) 
which is periodic in x with period 27r/g, 

^(x^2ir/q,y) = ^(x,y), (124) 
and periodic in q with period q£ 2 M , up to a phase factor 

1 p(x,y + qe 2 M ) = e- i ^(x,y). (125) 
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Note if q = V2tt/£ Mi 

\tfj\ 2 forms a square lattice! The area 
of a unit cell is /q) x (q£ 2 M ) = 27r^, and the flux through 
each one is therefore 



M- 



j j he 

e*# 2e 



(126) 



where I inserted a factor of h in the last step. We haven't 
performed the minimization explicitly, but this is a charac- 
teristic of the solution, that each cell contains just one flux 
quantum. The picture is crudely depicted in Fig. 12a). Note 
by symmetry that the currents must cancel on the boundaries 
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Figure 12: a) Square vortex lattice; b) triangular vortex lattice. 

of the cells. Since j s = —en s v Sl integrating V0 + jj^A = 
around each square must give, as in our previous discussion 
of flux quantization in a toroid, 

$ ce u = n$ , n = integer. (127) 

Somehow the vortex lattice consists of many such rings. The 
problem with this idea is that the only way § • d£ = 8(f) 
around the boundary can be nonzero and the usual argument 
about single-valuedness of the wave function carried through 
is if there is a "hole" in the wave function. If there is no hole, 
or region from which the wave function is excluded, the path 
can be shrunk to a point, but the value of the integral must 
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remain the same since the integrand is the gradient of a scalar 
field. This is unphysical because it would imply a finite phase 
change along an infinitesimal path (and a divergence of the 
kinetic energy!) The only way out of the paradox is to have 
the system introduce its own "hole" in itself, i.e. have the am- 
plitude of the order parameter density \ijj\ 2 go to zero at the 
center of each cell. Intuitively, the magnetic field will have an 
accompanying maximum here, since the screening tendency 
will be minimized. This reduction in order parameter ampli- 
tude, magnetic flux bundle, and winding of the phase once by 
2tt constitute a magnetic "vortex", which I'll discuss in more 
detail next time. 

Assuming C n = constant, which leads to the square lattice 
does give a relatively good (small) value for the dimensionless 
quantity s, which turns out to be 1.18. This was Abrikosov's 
claim for the absolute minimum of /[|^| 2 ]. But his paper 
contained a (now famous) numerical error, and it turns out 
that the actual minimum s = 1.16 is attained for another set 
of the C n 's, to wit 

C n = n~^/ 2 , n = even (128) 
C n = m~^/ 2 , n = odd. (129) 

This turns out to be a triangular lattice (Fig. 12b), for which 
the optimal value of q is found to be 

3VV/2 

Q = —g—, (130) 
Again the area of the unit cell is 2tt£ 2 , and there is one flux 
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quantum per unit cell. 



6.4 Properties of Single Vortex. Lower critical 
field H cl 

Given that the flux per unit cell is quantized, it is very easy to 
see that the lattice spacing d is actually of order the coherence 
length near H C 2. Using (103) and (88) we have 

Hc2 = 7*h = w r (131) 

On the other hand, as H is reduced, d must increase. To see 
this, note that the area of the triangular lattice unit cell is 
A = y/3d 2 /2 1 and that there is one quantum of flux per cell, 
A = <$>o/H. Then the lattice constant may be expressed as 

d = ^(f ) 1/2 - (132) 

Since A ^> £ is the length scale on which supercurrents and 
magnetic fields vary, we expect the size of a magnetic vor- 
tex to be about A. This means at H C 2 vortices are strongly 
overlapping, but as the field is lowered, the vortices separate, 
according to (126), and may eventually be shown to influence 
each other only weakly. To find the structure of an isolated 
vortex is a straightforward but tedious exercise in minimizing 
the GL free energy, and in fact can only be done numerically 
in full detail. But let's exploit the fact that we are inter- 
ested primarily in strongly type-II systems, and therefore go 
back to the London equation we solved to find the penetration 
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depth in the half-space geometry for weak fields, allow n s to 
vary spatially, and look for vortex-like solutions. For example, 
equation (75) may be written 

-A 2 V x V x B = B. (133) 

Let's integrate this equation over a surface perpendicular to 
the field B = B(x,y)z spanning one cell of the vortex lattice: 

-A 2 / V x (V x B) • dS = jB-dS, (134) 
-X 2 ^fl-d£ = <D . (135) 

— » — » 

But we have already argued that j s • dl should be zero on 
the boundary of a cell, so the left side is zero and there is 
a contradiction. What is wrong? The equation came from 
assuming a two-fluid hydrodynamics for the superconductor, 
with a nonzero n s everywhere. We derived it, in fact, from 
BCS theory, but only for the case where n s was constant. 
Clearly there must be another term in the equation when a 
vortex- type solution is present, one which can only contribute 
over the region where the superfluid density is strongly varying 
in space, i.e. the coherence length-sized region in the middle 
of the vortex where the order parameter goes to zero (vortex 
"core"). Let's simply add a term which enables us to get 
the right amount of flux to Eq. (133). In general we should 
probably assume something like 

\ 2 V xV x B + B = $ g(r)z (136) 

where g(r) is some function which is only nonzero in the core. 
The flux will then come out right if we demand / d?rg (r ) = 
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1. But let's simplify things even further, by using the fact 
that £ <C A: let's treat the core as having negligible size, 
which means it is just a line singularity. We therefore put 
g{f) = 8(f). Then the modified London equation with line 
singularity acting as an inhomogeneous "source term" reads 

-\ 2 V 2 B + B = $ 5 2 {r)z (137) 
pop op 

where p is the radial cylindrical coordinate. Equation (91) 
has the form of a modified Bessel's equation with solution: 

B, =^(f >. (139) 

The other components of B vanish. If you go to Abramowitz 
& Stegun you can look up the asymptotic limits: 

B z = ^2 N(^) + °- 116 L £ < P « A (140) 



5, = 



o 



A0 — e~ p /\ p>A. (141) 

Note the form (93) is actually the correct asymptotic solution 
to (91) all the way down to p = 0, but the fact that the 
solution diverges logarithmically merely reflects the fact that 
we didn't minimize the free energy properly, and allow the 
order parameter to relax as it liked within the core. So the 
domain of validity of the solution is only down to roughly the 
core size, p c± £, as stated. In Figure 5 I show schematically 
the structure of the magnetic and order parameter profiles in 
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an isolated vortex. The solution may now be inserted into the 




supercurrents 



Figure 13: Isolated vortex 



free energy and the spatial integrals performed, with some 
interesting results: 

F = L *iikz hg{K) - m 

It is easy to get an intuitive feel for what this means, since 
if we assume the field is uniform and just ask what is the 
magnetic energy, we get roughly 

I 

F v = — x vortex volume x B 2 (143) 

87T 

~ -!- x (ttX 2 L z ) x ($ /7tA 2 ) 2 (144) 

07T 

= L W (145) 

the same result up to a slowly varying factor. Now the lower 
critical field H c \ is determined by requiring the Gibbs free 
energies of the Meissner phase with no vortex be equal to the 
Gibbs free energy of the phase with a vortex. 20 G differs from 

20 We haven't talked about the Gibbs vs. Hclmholtz free energy, but recall the Gibbs is the appropriate potential 
to use when the external field H is held fixed, which is the situation we always have, with a generator supplying 
work to maintain H . 
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F through a term — / BH/Att. In the Meissner state G = F, 
so we may put 

F = F + E lme L z - ^H cl J Bd 3 r (146) 

47T 

= F + E line L z - ^-%L Z , (147) 

47T 

where £^ ne is the free energy per unit length of the vortex 
itself. Therefore 

H cl = 47T ^ ine (148) 

is the upper critical field. But the line energy is given precisely 
by Eq. (95), E Hne = I g^log(«;), so 

tf c iCT) = ^log(«). (149) 



6.5 Josephson Effect 
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6 Quantum Transport 



6.1 Disordered metals 

In this course we have thus far largely discounted the effects 
of disorder, which are of course vitally important for the un- 
derstanding of real experiments. Understanding why under 
some circumstances we are allowed to ignore disorder is as 
important as understanding why we are sometimes allowed 
to ignore electron-electron interactions. Indeed, work on the 
two subjects (disorder and interactions) has been pursued in- 
dependently for several decades, with only a few noteworthy 
attempts to solve the problems of celebrated difficulty which 
involve both. A great breakthrough on the problem of disor- 
der in 2 and 3D came in 1979, with the application of renor- 
malization group ideas to the localization problem ( "Gang of 
4", Licciardello et al 1979). 1 Ideas from this work are key to 
debates going on today in the 2D metal-insulator transition 
area and others. I am going to try to lead you through the 
physical ideas necessary to understand this breakthrough and 
its implications. 

First we review briefly the notion of a metal-insulator tran- 
sition. This we take to be a phase transition which separates 
two physical regions which differ in their electrical DC con- 
ductivities at zero temperature a(cu = 0,T = 0) = cr(0). A 
metal has finite conductivity cr(0), whlie an insulator is char- 
acterized by <r(0) = 0. You already know an example of this 

1 Disorder in ID has its own special ideas and terminology, which I will only briefly touch on here. 



1 



type of transition which does not involve disorder at all: the 
Mott transition which takes place, e.g. in the Hubbard model 
at half-filling due to interactions. In that case the parameter 
one tunes is the on-site Coulomb interaction U to go from the 
metallic (small U) to insulating (large U) state. I'm going to 
stick to discussing metal insulator transition in noninteract- 
ing disordered systems, sometimes referred to as the Anderson 
transition. By "disorder" , one means the deviation from strict 
lattice periodicity due to impurities or defects. 2 

6.1.1 Boltzmann equation for impurities 

The semiclassical theory of transport you learned last semester 
is based on a key assumption, which is often swept under the 
rug, namely that the mean free path due to whatever scat- 
tering processes one is considering, is longer than the Fermi 
wavelength. Here I'll eventually give a review of Boltzmann 
results on impurity-limited transport. See Ashcroft & Mermin 
Ch. 12 for now. 

Reminder: Drude Conductivity. At T = 0, there is no 
inelastic scattering due to phonons or e~ — e~ collisions; in fact 
in an unbounded, periodic lattice no scattering occurs at all, 
and the system has infinite conductivity; this is a consequence 
of quantum mechanics. We could equally well say that the 
collion time r for scattering, or the mean free path £ = vpr 
are infinite. The wavef unction of a particle in such a case is 

2 Thcsc defects can be so numerous that it no longer makes sense to think of small deviations from periodicity, in 
which case the word "amorphous" is sometimes used. 
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said to be ballistic. 

In a lattice weakly perturbed by defects, the situation is 
changed. The scattering from defects makes r and £ finite. 
The DC conductivity is then also finite and given by 



6.2 Localization 

In the case of Boltzmann transport which leads to the Drude 
expression (1) for the conductivity, the motion of the particles 
is assumed to be diffusive (Fig 1) The wave function ip(r) of 




Figure 1: Diffusion of a quantum particle in a disordered system. 

this particle is clearly still extended] it has a finite probability 
to be anywhere in the system, or in other words 

Hm |V(r)| 2 ^0. (2) 
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Claim: when we make the system more disordered, the wave 
function may become localized, i.e. the particle gets trapped. 
This is signalled by 



where £ is the localization length. How disordered does a 
system have to get for this to occur? To quanitfy the claim, 
we consider a specific ID model, generalize the Kronig- Penney 
model of an electron moving in a periodic 5-function lattice 
to a set of 5-function potentials of strength V{ at random 
positions with separations cl{. The potential is then given 




(3) 



1 1 



i t 



Vi 



i 1 




Figure 2: Generalized Kronig-Penney model of ID disordered solid. 



by 



so the Schrodinger eqn. becomes 

-V 2 + E ViS(x - Xi) - k 2 ij){x) = 





(5) 
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with k 2 = y/2mE. Imagine one of these peaks is located at 
the origin. The wave functions ip±(x) for x^O are then 

ijj± = A ± e lkx + B ± e~ lkx . (6) 

The boundary conditions at x = for the wave functions and 
spatial deriviatives are then 

^+(0) = V-(o) 

$h(0) = V-(0) + W(0), (7) 

Now we look for a solution which is periodic^ such that \4>+(a) | 2 
^_(0)| 2 for some arbitrary value of x = a. Another way of 
stating this is 

+ (a) = e*>_(0) (8) 

for arbitrary k and a. Plugging into the wave function (6) 
then yields 

cos ka — cos ka = — - sin fca, (9) 

k 

which gives k and a as implicit function of k. For a chain of 
such potentials, it should be clear that the wave function will 
be periodic everywhere (and the state therefore of extended 
character) only if all the a^'s and all the VJ's are equal. Oth- 
erwise (i.e. for any disorder, all wavefunctions obeying the 
usual boundary conditions must fall off exponentially at large 
distances, i.e. they represent localized states. Any statistical 
distribution of a^'s and/or VJ's lead to localization in ID. 
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6.2.1 Anderson model 



Anderson (1958) investigated a model which contains some 
features of the disorder in real 3D systems, and in the pro- 
cess proposed the first quantitative definition of localization. 
Instead of moving some of the ionic positions away from the 
regular lattice, he considered a periodic lattice and a Hamil- 
tonian H consisting of an ordinary periodic kinetic energy 
— t^i cjq + i + h.c. but random site energies V l 4 Only for the 
special case that all Vi are equal can we say with certainty that 
energy bands are formed and all states are extended (Bloch 
waves). But what happens (Anderson asked) if we now as- 
sume the Vi are statistically distributed according to 



PWi 



Vi 



w 

otherwise 



< w 

— 2 



(10) 



Anderson now proposed calculating the probability P that a 
particle, beginning at t = on site i, returns to the same 
site in the limit of long times t — > oo. If this probability is 
nonzero {P > 0), the particle is said to be localized, whereas 
if it is zero the particle is clearly in an extended state, where 
the chances of it returning is essentially zero. 5 I won't re- 
peat the calculation but give the result and try to motivate 
it a bit. The answer is expressed in terms of the site energy 
variance W which appears in the probability distribution for 

3 P.W.Andcrson, Phys. Rev. 109, 1492 (1958) 

4 It is generally believed that, since translational invariance is destroyed, models of this type are equivalent to 
more physical models as depticted e.g. in Figure 1 

5 This definition therefore clearly has the virtue of describing in a simple way why it is much more likely in low 
dimensions, particularly ID, to find localized states. 
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the site energies. It is a measure of the average on-site en- 
ergy fluctuation in the system, and, hence, the "amount of 
disorder". A second energy is the bandwidth D of the un- 
perturbed kinetic energy (for a 3D n.n. tight binding model 
it would be D = 8t-check!). Anderson's calculation yielded 

a sharp localization transition: for W/D<b extended states 

existed, while for W/D>5 they disappeared. In other words, 
there is a sharp boundary in energy between localized and ex- 
tended states. If an eigenstate lies below the threshold energy 

("mobility edge"), E<E C ~ W/5, it is localized, whereas if 
its energy E > E Cl it is extended (see figure 3). How can 



energy 

a 



D/2 



extended 




localized 













Eo/2 
W/2 



Figure 3: Random energy levels of Anderson model with disorder W and bandwidth D. 

this transition be induced in practice? Either by increasing 
the disorder level, or by tuning the characteristic energy of 
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electrons (Ep) through the mobility edge. 

What we actually measure to exhibit the transition is of 
course the conductivity The DC conductivity of a ^-dimensional 
system is 

e 2 a 2 ~ D £ £ , N 

= -r = ^ c -, (11) 

h 7T a a 

where £ = vpr, vp = hkp/m, and hp = 7r/a. Note the quan- 
tity a c is universal in the sense that it doesn't depend on 
disorder through £. The disorder dependence comes in only 
through the ratio £/a. But recall that the original deriva- 
tion of the Drude conductivity was via the Boltzmann equa- 
tion, where we had to assume £/a ^> 1 (Ioffe-Regel criterion). 
Mott therefore postulated that there would be a minimum 

possible metallic conductivity cr mm <cr c . Mott argued that 
as one approached the transition from the metallic side, the 
conductivity would decrease proportional to £ and then when 
a = (7 m i ni would drop discontinuously to zero (1st order tran- 
sition), as shown in Figure 4. Experiments in recent years 
have shown that it is at least possible to obtain conductiv- 
ities well below the minimum value suggested by Mott, and 
generally supported Anderson's picture of a continuous transi- 
tion. In Figure 4 b) are shown data where the metal-insulator 
transition was tuned in Si by addition of donor P atoms and 
the conductivity of different samples thus prepared compared 
with Mott's prediction. ?? Question- Resistivity saturation? 
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mm 




disorder 



Figure 4: a) Approach of the conductivity to the localization transition; b) Experiments of Rosen- 
baum (1980) and Palanen (1982) on Si:P. 

6.2.2 Weak Localization 

Let us now consider the localization problem a bit more care- 
fully, starting from the metallic side far from the transition 
where it is more tractable. Our goal will be to understand 
the so-called quantum corrections to the Drude conductiv- 
ity (Jo, i.e. corrections of relative order l/{kp() or higher in 
the regime where such an expansion is applicable. We start 
from what is sometimes called the Edwards, or binary alloy 
model, where a few point like scatterers of equal strength Vo 
are randomly distributed over a periodic lattice. We want to 
know the conductivity a or diffusion coefficient D for such a 
system, recalling that the two properties should be related by 
the Einstein relation 



where Nq is the density of states at the Fermi level as usual. 
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In this case let's take a dimensionless measure of disorder 
A rsj UjV^ jE\, where rii is the impurity concentration. This 
parameter is sometimes called the "coupling constant" or "dis- 
order strength" in this type of model. For D = 1 we saw that 
arbitarily weak disorder strength A — > leads to localization. 
The "critical value" of disorder A c above which one finds lo- 
calization, is zero. In D = 3 we know A c is different from 
zero, since for sufficiently weak A we know we find metallic 
behavior, i.e. nonzero a at T — > 0. What about D = 2, 
which appears to be a marginal case? This is a difficult ques- 
tion, which is still at the center of many debates (although the 
problem without interactions appears to have been resolved). 
Is there an Anderson transition in D = 2? Is A c = or > 0? 

Let's start with very weak disorder A « 1. We assume 
metallic behavior, and that effects of disorder can be treated 
perturbatively far from any Anderson transition. This will be 
true in 3D; we don't know if it works in 2D or not yet. Weak 
disorder can also be expressed in terms of the mean free path 
being much longer than the interparticle distance, i.e. kp£ ^> 
1, or equivalently Ejpt ^> 1. For concreteness, let's choose 
A = (2ttEft)~ 1 , and try to calculate the "precursor" effects 
of localization, i.e. the corrections 5a to the conductivity 

a = do + 5a, \5a\ <^ <7 . (13) 

These corrections are commonly referred to as "weak localiza- 
tion effects" . We'd like to know how 5a depends not only on 
disorder, but also other external parameters we can control, 
like system size L, frequency cj, temperature T, and magnetic 
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field H. 

Diffusion of quantum particles 

The Drude conductivity from which we begin is propor- 
tional to the inverse coupling constant as mentioned before, 

e 2 a 2 - D £ 1 

a = cc - 14 

h 7T a A 

Recall in the Boltzmann approach which led to this result 
consecutive collisions of particles are taken to be uncorrelated, 
i.e. multiple scattering is not taken into account. If there is 
a finite probability for such processes to occur, this result 
should be corrected. To see what we are missing, we exam- 
ine the diffusion of a particle in a ^-dimensional disordered 
system. Suppose that at time t = the particle is located 
at i*o. The particle diffuses away at later times, and we as- 
sume that its position can only be described statistically by a 
smooth probability distribution P(r, t) which obeys the dif- 
fusion equation: 

dP 

— - D V 2 P = 0, (15) 

where the diffusion constant is given by Do = v 2 F r/d, vp is the 
characteristic velocity of the particles (in this case the Fermi 
velocity). The explicit solution of (15) is 

|r - r | 2 



e 4 A)t 

P M) = 7 VnT? ( 16 

V 7 {AnDot ) D/2 V 7 



ii 



For long times the exponential is unimportant, so that 

| (ZV)- 1/2 D = 1 
P(r,i)oc — = poO" 1 ^ = 2 (17 
Ko!l// [ (Dot)- 372 £> = 3 

Note Vdiffis the ^-dimensional diffusion volume in which the 
particle has diffused after a long time t 6 So far everything 
we have said is purely classical. To understand what might 
change for a quantum particle, consider how different paths 
between different points contribute to the total probability 
(see figure 5). The trajectories have a typical uncertainty 
which is of course zero in the classical case, but something 
like a Fermi wavelength \ F = h/ (mvp) in the quantum case. 
Thus we can think of these paths as being really tubes of 
probability flux with finite diameter. Furthermore, we will 
assume that along each tube the quantum mechanical phase 
is preserved, i.e. we are at low enough temperatures such that 
the inelastic scattering rate is much larger than the elastic 
one, Ti n ^> t. There is for each path or "tube" a probability 
amplitude, and the total transport probability from A to B is 
just 

W= |E^| 2 = E|A,| 2 + E A, A*. (19) 

The first term represents noninterfering trajectories, i.e. the 
classical case when the tubes are infinitely narrow. The sec- 
ond term describes the quantum-mechanical interference pro- 

6 Note probability is normalized / dPrP = 1 at any time, so at long times when P no longer depends on r, we 
essentially have 

/ P = V Diff P = 1 (18) 
Jv dif , 
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cesses, neglected in the analysis of the Boltzmann equation. 
Usually this analysis is fully justified, since each path carries 
a different phase, and when one adds up all the paths the 
phases cancel, i.e. the interference is totally destructive. 




Figure 5: a) Diffusion paths between A and B; b) time-reversed return paths. 



In one special case one can see this is not true. Suppose 
points A and B are the same, i.e. we are interested in the 
return probability as discussed above. Time reversal invari- 
ance now tells us that the paths 1 and 2 shown have identical 
amplitudes A\ = = A, including in particular identical 
phases, leading to constructive interference for the two pro- 
cesses . The probability of return is therefore 2\A\ 2 in the 
classical case and 2\A\ 2 + 2A\A^ = 4|A| 2 in the quantum 
case. Therefore 

Wqm = '2W c i assica i (20) 

Thus it appears that quantum-mechanical diffusion must be 
slower than classical diffusion due to these multiple scattering 
processes alone; one can think of the effect we just described as 
an additional backscattering effect which does not exist in the 
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classical case. There is a correspondingly lower conductivity 
in the quantum case. 

One should remark that the above result obtains simply be- 
cause we allowed different trajectories between any two points 
to interfere. This is a quantum effect in the case of electrons 
because we have accounted for their wave nature. However 
in the case of wavelike phenomena of any type, including 
those thought of as classical in nature, like the propagation 
of light, the same effects will occur. It is possible to observe 
"localization" of classical light waves by looking for coher- 
ent backscattering in a disordered medium, say scattering of 
light by tiny polystyrene spheres in aqueous suspension. The 
transmission is reduced from the amount predicted by an inco- 
herent scattering calculation by just the right amount due to 
to constructive interference of light waves travelling on closed, 
time-reversed paths just as in the case of weak localization of 
electrons. 

Conductivity correction. We now need to be a bit 
more quantitative and try to estimate the size of the correc- 
tions to (Jo due to these effects. Due to the above discussion 
we expect the sign of the correction 5a to be negative. To do 
this we need to estimate the probability of the closed configu- 
rations among all configurations contributing to <j. First note 
that the interference contribution of two tubes is proportional 
to their intersection volume. But the return path tubes are 
unique in that their entire volume is necessarily an intersection 
with a time-reversed path. What we'll do is simply calculate 
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the ratio of the volume of a small element of the return tube 
to the entire diffusion volume the particle could occupy at a 
time t (which includes all paths) . This is the probability that 
a particle will return & intersect itself during the diffusion. 
As shown in Figure 6, the volume of the tube dV swept out 



cross- 
section 

,D-1 




d^Vpdt 




a) 



b) 



Figure 6: a) small volume element of return path tube; b) comparison of total diffusion volume 
with that of return path tube. 

by the particle in a time dt is the cross section A^ -1 times the 
length swept out vpdt, so the probability is 

dV . n i ^. dt , A 

(21 



R 



closed 



r. 



= Vp\ 



D- 
F 



1 J T in 



V diff ~" r h (D t) D ^ 

where the lower limit r is essentially set by the time the par- 
ticle starts diffusing, and the upper one by the time where 
phase-breaking scattering destroys the phase coherence of the 
individual trajectories and hence the possibility of interfer- 
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ence. Performing the integrals, we obtain 



) V2 D = 1 
oc -A x log (^) L> = 2 

)" 1/2 £ = 3 



(22) 



To get a feeling for what these mean and how big they might 
be, assume that the inelastic relaxation rate is some power 
of the temperature r-" 1 ~ T p (for electron-phonon collisions 
it would be p = 3 ( T <C Tp), for electron-electron p = 2). 



Remarks: 

• Corrections all decrease conductivity, consistent with ex- 
pectation that quantum effects amount to additional source 
of backscattering relative to Drude (classical) result. 

• In D = 3 these corrections are negligible at low T. 

• In this approximation the correction is linear in the disor- 
der parameter A 

• For D = 1, the correction is actually classical (since tubes 
have zero diameter); also the result is divergent, consisent 
with full localization in ID. 

• In 2D, correction is logarithmic but also leads to diver- 
gent result. We are therefore not allowed to use the weak 
localization result in 2D below a temperature set roughly 



Then 




T -p/2 D = 1 
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by (Jo — 5a. In particlular, we are not allowed to draw 
conclusions about whether all states are localized or not 
at T = 0. 

• We needed to assume time reversal invariance to make 
the argument above, which is not correct in the presence 
of magnetic field (see below). 

6.2.3 Scaling theory of localization 

An answer to the question of what happens in 2D, as well as 
many other questions, was provided by the scaling theory of 
localization 7 The approaches to the problems discussed above 
are mostly suited either to the metallic or the insulating side 
of the metal-insulator transition, but are not capable of dis- 
cussing what happens at the transition itself. Abrahams et al. 
(the so-called "gang of four") drew upon ideas about phase 
transitions which had been developed in the late 60's and 70 's 
by Fisher, Kadanoff and others, which proposed that because 
phase transitions involved fluctuations on all length scales, 
properties of the system would become scale-invariant. Scal- 
ing theories asked how properties of the system changed as the 
size of the system was increased, and made scaling hypothe- 
ses regarding the functional independence of these properties 
of microscopic system parameters near the transition. 

Abrahams et al. chose as an appropriate property the con- 
ductance g = \jR of a hypercube of side L (R is the re- 
sistance), whose importance for the localization problem had 

7 Abrahams, Anderson, Licciardello & Ramakrishnan (1979). 
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been pointed out by Thouless. We know two limits of how 
g depends on system size. In the metallic (ohmic) regime, 
g rsj Z/ 0-2 , 8 whereas in the localized regime because the wave- 
functions have a characteristic size £ the conductance must 
scale exponentially g ~ exp(—L/^) for L ^> £. How can 
we interpolate between these two known cases . Assume we 
have a hypercube of side L and conductance g(L), and ask 
how the conductance g(bL) of a larger hypercube of side bL 
(6 > 1 is a scale factor) depends on the conductance of the 
starting cube. In principle, g(bL) could depend not only on 
the value of the conductance g{L) at the smaller scale, and 
the length L itself, but on lots of other microscopic proper- 
ties, g(bL) = f\g(L), L, r, hp, . . .]. But let's assume that near 
the transition the behavior is universal, i.e. the only relevant 
parameter is the g of the smaller system, i.e. 

g(bL) = f(b,g(L)), (24) 

or equivalently 

g(L) = f[b,g(L/b)}, (25) 

and / is some function. Eq. 25 is what we mean by the scaling 
hypothesis of Abrahams et al., from which the entire theory 
follows. Note that the relation should hold for any 6, so that 
there is nothing to prevent us from differentiating both sides 
with respect to 6, and then setting 6=1. Then we obtain 



8 Rccall from freshman physics that for a resistor R = pL/A, where L is the length and A is the cross-sectional 
area. For a cube R = 1/g ~ L -1 , so g ~ L. 
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Figure 7: Schematic localized wavefuntion in two different size systems. 



where f\ is a new function of g involving /. Thus the loga- 
rithmic derivative of g depends only on g itself, and not on 
any other length scales! Abrahams et al. defined a /3 function 
(the name comes from renormalization theory): 



1 dg(L) d\ogg(L ; n 
g a log L a log L 

where fi is a new function which is still only dependend on 
g(L). The f3 function has known limits based on what we 
have already figured out about g: 

log g localized 

f3 = \ (28) 
D _ 2 _c(p) n \ metalll 

l 9 W) ' 

where c(D) is a dimension-dependent factor which controls 
the weak-localization corrections discussed in the previous sec- 
tion. Already at this stage it is reasonable to guess a smooth 
interpolation between these limits. The guess of Abrahams et 
al. is sketched in Figure 8. 

What do these curves mean? Note if /3 > 0, it means that 
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fixed 




1 

Figure 8: /3-function prediction of scaling theory of localization. Solid lines correspond to analytic 
estimates made in text. 

dg/dlogL > since g > always. This means that in- 
creasing the system size increases the conductance. Look for 
example at the D = 3 case above the point where the curve 
crosses the (3 = axis. Any logg corresponds to a particu- 
lar L since the curve is monotonic; increasing the system size 
from here increases the conductivity, hence moving you to 
the right along the curve. This is the meaning of the arrows 
pointing to the right on this curve above (3 = 0. The system 
"flows" towards metallic behavior as you go towards the ther- 
modynamic limit. Every other point on all the curves shown 
corresponds to (3 < 0, so the arrows point the other way: 
as you increase the system size, the conductivity decreases, 
i.e the system "flows" towards localized behavior. Only the 
D = 3 curve has a "fixed point (unstable)" with (3 = where 
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the flow moves away in both directions. This point indicates 
the existence of an Anderson transition in 3D. 9 

What about 2D? Can we induce a transition between metal- 
lic and localized behavior in very thin films? The forego- 
ing considerations suggest the answer is no. Note the (3 — 
function for 2D is always negative, i.e. the system always 
becomes localized as you increase the size. It is interesting to 
connect this discussion to the preceding one on weak localiza- 
tion. We knew from classical considerations in the strongly 
metallic regime only that (3 —> as log L —> oo. The sign of 
the weak localization correction to the /^-function, — c(2)/g, 
with c(2) > 0, indicates that even if, at a given system size L, 
the conductance were huge, the "flow" would be towards local- 
ized behavior, i.e. for macroscopic film sizes there should be 
no extended states. 10 Thus the weak localization calculation 
is extremely important even though we said it only strictly 
applies in the "perturbative regime". 11 

Returning to 3D for a moment, we note that information 
about the transition itself can be extracted. Near the fixed 

9 The terms "flow" and "unstable fixed point" come from the theory of Wilson's renormalization group (RG), 
where the meaning is only slightly different. Usually when studying RG flows, we are looking at the transformation 
of a set of parameters describing how the system's Hamiltonian changes as the scale is changed, or, what is frequently 
equivalent, energy or temperature is reduced. See e.g. N. Goldcnfcld, Lectures on the Renormalization Group 

10 At this writing the existence of metallic behavior in 2D is still controversial. In 199x, experiments by Kravtsov?, 
Sarachik, and Pudalov? suggested that a transition to metallic behavior was possible even for very thin films. Some 
theorists believe this is a genuine metal-insulator transition in 2D which violates the predictions of the scaling theory 
because the effects of electron-electron interactions, not accounted for in the scaling theory, are important enough to 
slightly shift the marginal behavior of the /3-function so that a fixed point is created. Others believe the experiments 
can be explained by traps or other "T-dependent disorder, a picture which implies that the apparent metallic behavior 
must disappear at sufficiently low temperatures. 

n We said before the weak localization calculation started in the "metallic regime". In 2D this means in the regime 
where g is large, even though the system is never truly metallic. 
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point, the /3-function can be linearized as 




v V 9c I ' 



(29) 



where \ jv is defined to be the slope at the transition, d(3/d log g 
We can integrate this to find 



where go = g(Lo). In the localized regime, we substitute the 
assumed form g 

6.2.4 Universal conductance fluctuations 

6.2.5 Effect of Magnetic Field 

6.2.6 hc/e and hc/2e oscillations 

6.3 Quantum Hall effect 

6.4 Mesoscopic transport 

6.4.1 Coulomb blockade 



L ( 9 ~ 9 c) 



(30) 



Lo [go — 9c) 
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1 2nd quantization and stat mech (continued) 



1.1 Appendix. Free Fermi Gas 

The hamiltonian is 



where 

The mean value will be 
where 

At T -> and in 3D 



H = ? kcLck CT 

kcr 

e k = e k - [i. 
(H ) = ^e k ( c L c k<x> = ^e k n k(T 

kcr kcr 

1 

n 



kCT e*/ T + 1 



kcr 

Derive (c k c k ) = n k (neglect spin). 

1.2 Appendix. Free Bose Gas 

"Same" Hamiltonian 

Ho = e k a k a k 

k 

where a k has bosonic commutation relations. Here we have 

K = ( a W) 
The density of states for a 3D gas is 



e /3e~k _ 1 • 



k 

so by applying the conversion 



3/2 

1/2 

4tt 2 V ft 2 J 



k 

we have 



/•oo 

(n) = c / de- 
Jo < 



./J(e-fi) _ 1 _ n 




Figure 1: Chemical potentials of free homogeneous Bose, classical, and Fermi gases fii/T vs. 
temperature T. 



and we can solve for [i. Note due to the negative sign in the denominator [i must always 
be negative otherwise the occupation number < for some k. Note further there is a 
problem as T — > 0, with e > /i, instead of being equal to zero we have (n) — > rather than 
a fixed number. That means there is no solution for Where is the error? We forgot the 
possibility of macroscopic occupation of the ground state, i.e. 

poo £ l/2 

X>^° + c / o+ r ( n ) 

We can extract the critical temperature that this can happen by taking the limit Uq — > 
which will determine the onset of the Bose-Einstein condensation (BEC) temperature while 
\i = 0. In other words 

roo 1/2 

n = cT 3/2 / rfx^- (12) 
Jo e — i 



C(3/2)r(3/2) 

which means that for the 3D parabolic band case we have T ~ n 2 / 3 . 



2 Confinement of atoms and BEC 

2.1 History of BEC 

1908 Discovery of superfluid 4 He ("He-II"). 

1938 F. London suggests He-II = BEC but in liquid state that means there are strong 
interactions present and n (T = 0) 1. 
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1959 Hecht: spin-polarized H might remain gaseous to low T: 2 H| atoms can not form H 2 . 
1980 Silvera and Walraven stabilze Hf. 

80's Search for methods to confine Hj — > magnetic traps (Kleppner Graytak). 
1989 Dehmelt and Paul Nobel prize for ion traps. 

1995 BEC in laser traps: C. Wiemann, E. A. Cornell(Rb), W. Ketterle(Na). 
2001 Nobel prize to Wiemann, Cornell, Ketterle 



2.2 Atom confinement 

In modern laser or magnetic traps atoms are confined in an approximately harmonic poten- 
tial. Let us compare the DOS for harmonic trap and free gas in a box with hard walls. 



Free gas (3D) 


Harmonic trap 


i 


h = e{£ + + ^ + 

i 

£ ni ,n 2 ,n a = tvu\(n x + \) + huj 2 {n 2 + §) + huj 3 (n 3 + \) 


N (z) = 7& 1/2 





Aside from the power law N(e) ~ e a in DOS, there should be little qualitative 



difference in 2 noninteracting systems, so let us redo the BEC for a general N(e) = C a e 



a-l 



N = N ex (T c ,fi 



POO 

0) = / deN(t 
Jo 



3 e/T c 



dx 



x 



a-l 



(13) 



so we have 



N 1/a . 



(14) 



In other words for the free gas T c ~ N 2 ^ 3 while for the harmonic trap T c ~ N 1 ^ 3 . 
For a 2D free gas N(e) ~const. and 



Jo e*-l 



oo 



(15) 



so there is no BEC in a 2D free gas except at T—0. But in a 2D harmonic trap N(e) ~ e so 
a = 2 and T c ~ N 1 ^ 2 . This does not violate theorems we will encounter later about lack of 
long range order in 2D at finite T because the system is not infinite in extent. 
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n 




1-(T/T c f 



T 



T 



c 



Figure 2: Condensate fraction vs. T for gas with N(e) 



a-l 



density of states. 




Figure 3: Two different methods of detecting condensate in atomic clouds with laser: a) 
direct measurement of absorption profile, suffers from heating effects; b) phase contrast 
imaging. 

2.3 Condensate fraction 

For T < T c so that /i = the number of excited particles will be 




(16) 



which converges for a > 1 into the value 



N ex = C a T(a)C(a)T 



(17) 



so we get 




(18) 



2.4 Density profile and velocity distribution 

Or, how to tell if you have a BEC! 

The density profile of the trapped cloud is measured with a laser. 
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2.4.1 Ground state profiles 

Lowest SHO eigenfunction is 



-.2 



Ml) = ^a^ e *' e "* e (19) 



where 



muji 

The density profile is n (r) = |0 o (r)| 2 or in momentum space 



— ~ lCT 1 //™- (20) 



n (p) =iV|0 o (p)| 2 = ^3/4 e ?e ^ e ^ ( 21 ) 
where 

Q = -. (22) 

This is a Maxwellian distribution with effective "temperatures" Tj = a>j/2. The main point 
is that the velocity and density distributions are in general anisotropic (x,y,z are different). 

2.4.2 Semiclassical (T > T c ) profiles 

In this case we have 



1 

n[r) = 

where 



2 2 2 

n(r) = ^ I d 3 pe-? H ~ e- y ^' T = e ~% e _ %e~% (23) 



IT 

R\ = — 2 . (24) 
but the momentum distribution will be 

n(p) = i J d 3 re-? H ~ e^ 2 " 17 (25) 

that means isotropic. 

Remarks. The size of the cloud of the thermal excitations is ^> the size of the 
condensate. Quatum mechanically we have SHO levels, where the size of the nth level is 
( r )n ~ y/no-o with a = a/1 / mu for the isotropic case. The thermal occupation of the levels 
gives nuj ~ T so we have 

ff 

(r) n ~ \ — a ~ Ri (from semiclassics) (26) 

V LO 

so the semiclassic approximation is justified for excitations as long as T/u ^> 1, which is well 
satisfied for most traps. Typical values are T ~ /iK and uj ~10 3 Hz ~ nK. To detect BEC 
we create a cloud, and allow it to expand. The condensate will be highly anisotropic, but 
since n{p) for the thermal excitations is isotropic, the cloud of excitations becomes isotropic. 
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a) Hot beam b) Cooling stage ("Zeeman slower") c) Trap 

Figure 4: Basic configuration for creating, cooling and trapping alkali gases (schematic) note 
in particular I have left out possible evaporative cooling stage, and depicted only a simple 
quadrupole trap. 



2.5 Trapping and cooling of atoms 

Evaporate atoms from the surface of alkali metal (~ 600K). Usually the two-body recom- 
bination times are slow (of the order of the age of the universe!) because it is hard to 
satisfy conservation laws. The three-body recombination times are of the order of minutes, 
so experiments must be done during this time. The system is in principle not stable but 
metastable. 

For the cooling the atoms absorb light and slow down. For the absorption we need 
oj = E n+ \ — E n . The velocity dispersion in the cloud causes a Doppler broadening Sen where 

5e D = -k • v (27) 

while v is the atom velocity and k the laser wave vector. For the absorption in the lab frame 
we have 

7/2 

7p = l + [2(cu-cuo + 5e D )/7] 2 (28) 

where u — uj is referred as "detuning" and c^o is the rest frame resonant frequency. For 
maximum deceleration we have ui — uj + 5e£> < 7 and makes the laser resonant with atoms 
in their rest frame and we will have 

F = hk lp F rnax -> (29) 

As atoms in the beam slow down, the changing 5eo takes them out of resonance so they 
can not cool further. To compensate for that we can introduce an inhomogeneous magnetic 
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bare atomic line 




Figure 5: Spectrum of atomic transition with and without Doppler broadening. 
Figure 6: a) Quadrupole trap; b) Ioffe-Pritchard trap. 



field — "Zeeman cooler". As the atom slows down, it progresses further along the path and 
the Doppler Sen decreases but Sezeeman = • B. We choose B and B' to keep the cloud 
together on resonance. The result is a cooling down to 30 m/s. 

2.5.1 Magnetic traps 

The design is a combination of wires and coils to trap some atoms though the Zeeman 
interaction. So spin-up atoms are attracted to regions of low field while spin-down are 
repelled. No static field configuration with maximum, can find minimum. The simplest 
quadrupole trap is 

B = B'(x, y, -2z) -> B = B'(x 2 + y 2 + Az 2 ) 1 ' 2 . (30) 

We have assumed that /i aligns itself with the local field and follows the field lines so 
Sezeeman = Hl-^li A 4 < 0. Then the spin-up state atoms seek the lowest field magnitude. 

Problem : t-dependent B field causes spin-up go to spin-down transitions. When 
u e ff ~ 6 e zeeman arises since particles are moving in inhomogeneous field. Trap losses in 
vicinity of minimum with B = 0. 

Solution: "Plug hole": 

• TOP trap (time-averaged orbiting potential) use quadrupole but apply rotating homo- 
geneous field to move position of node in t dependent way. If u <^ fi(B) the rotating 
field itself won't cause transitions. 
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• Ioffe-Pritchard trap: First try two Helmholtz coils, same sense then 

B = consti + 3const 2 (^ 2 - -p 2 ) (31) 
The result is that it doesn't trap in both directions z, p. Add four wires then 



1 c 2 

B = ci + 3c 2 (z 3 - -p 2 ) + -. (32) 

2 p z 

Can now choose c such that trap is confining, i.e. that coeff. of p 2 is positive, and it 
will have desired minimum at \B\ > 0. 

2.5.2 Optical traps: 

Atom with energies E , E\ in presence of a laser field 

H D ipoie = -p • £{t) (33) 
The idea is that the electric field £ polarizes the atom 

aM = Kij£^ (34) 

where 7 is the natural lifetime of the excited state in the absence of the laser field, and u> is 
the laser frequency. Shift AE by 

^ = 4 a,(w)m ' = ^74 (35) 

and the transition rate of atoms leaving the ground state is 

l 9 = \a'\u){£ 2 ) (36) 

while the "Rabi" frequency is Qr = (l|p • £ u \0) and the detuning 5 = uj — oj while c^o = 
Ei — Eq. So AEo is less than zero if 5 < 0- "red detuning". Spatial variation of E leads to 
trapping, i.e. — V-Eo gives a restoring force near center of laser beam. 

A focussed laser field produces an electric field profile with maximum at one point; 
for red detuning atoms are attracted to the center. Detuning is also necessary to prevent 
absorption pressure and heating. To give concrete numbers, take example of Na, trap has 
laser wavelength A ~985 nm while laser wavelength A ~ 589 nm (Na d-line). The depth of 
the optical trap is determined by 5E at the center, of order pK in this case. 
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2.6 Scattering theory 

The particle wave function after the scattering event will be of the form 



rj} = e ikz + f{6) 



ikr 



(37) 



For low energies we will have s-wave scattering only which means that f{6) will become a 
constant of order a, the s-wave scattering length. The wave function will become 



if> -> 1 



(38) 



where the last term is the asymptotic outgoing scattered wave ip sc . If we define x = then 
X ~ r — a and a will become the r-intercept of the radial function x- The general wave 
function has the form 

V> = X^m (39) 
Im 

where 

sWfcr- — Itt/9. -I- 

(40) 



iS sm(kr - lir/2 + 6 t ) 
rii ~ e 



and for / = we will have 



_ iSo sin(kr + 5 ) _ iSo sin kr cos 5 + sin ^ cos kr k ^o iSo 
X Te - re - re 

kr kr 



cos 5 + 



sin5 

kr 



re iS ° cos 5 



1 + 



tan 5o 
kr 



= re tS ° cos S 



1 - 



so the definition of a is 



a = 



tan <5 



(42) 



(43) 



and for weak scattering we have 8 ~ — ka. For the scattering energy we have 

h 2 P 
€k ~ 2/i ~~ m 

where /i — m/2 is the reduced mass. If we go back to Schrodinger's equation now we have 

(e° k - el)^ sc {k') = U(k'k) + U{k'k")^ sc {k") (44) 



and Fourier transforming it using 



k" 



^sc(k) = I d 3 re- lkr ^ sc (r) 



we will get the formal solution 



(el-el+i5)- l T{k'XE = e 



(45) 



(46) 



where the "T-matrix" T(k,k';E) satisfies (matrix eqn. in k-space!) 



T = U + U V- 

^ F, — p , , 



7? E — £k" + iS 



T 



(47) 



For weak scattering (k — > 0) we can use the Born approximation T(k, k') = U (0, 0) and get 

Ak'r 



J2e^ sc (k>) = _£ e ^^0) ^ _ mf7(0)0) ^ 



k' 2 



mC/(0,0) _ a 
r 



47rr 



in other words 



mU (0,0) 

47T 



(48) 
(49) 



For a more general calculation we can replace U by an effective potential. 
2.7 Effective potential and scattering length 

Problem The bare interaction between two atoms is well known at large distances and pos- 
sibly known at short distances. (Van der Waals) 

If we define the dimensionless strength parameter C% and using Bohr's radius a = 
h 2 /m e e 2 we can form the quantity a = C&e 2 OQ characterizing the Van der Waals energy 
strength. By dimensional analysis we can find the length scale of the interactions. Since the 
kinetic energy is ~ while the potential energy is ~ ^ we have a minimum when 



am 



1/2 



fC 6 m\ 
\ m e J 



1/4 



a > a . 



(50) 



So the length scale which is approximately r ~ 10 2 a . Below is a table of the C 6 values for 
various elements. 



Element 


c 6 


H 


6.5 


Li 


1393 


NA 


1556 


K 


3897 


Rb 


4691 


Cs 


6851 



A few remarks are in order. 

1) Clouds themselves are at least 10 times bigger. 
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2) The strength of the Van der Waals forces are ~ ^-^j. Hydrogen has smaller C 6 
because the ls-2p transition is lowest, not ns^np! 

Question: How can we calculate energy shifts due to interactions without knowing 
the details of the short range potential? 

Answer : Intoduce an "effective interaction" that describes the interactions between 
long wavelength degrees of freedom accounting for the short wavelength ones. Short distances 
are "integrated out" . 

Our goal is to show that we can work with an effective potential of the form 

~ . . 4irh 2 a . . . 

U(r) = 6(r) (51) 

m 

where a is the s-wave scattering length which is defined in terms of the asymptotic large- 
r part of the radial wavefunction x( r ) = r i>sc{r) but depends sensitively on the short- 
distance behavior of the bare potential. If scattering is not weak a similar calculation gives 
^ sc {r) = _V^M1 wh ich leads to 

a=^^. (52) 

The low-energy T-matrix is the "effective potential" in the sense that it gives the exact 
result for SE when inserted for U in the Born approximation result for ijj sc if the atoms are 
far apart. So if we are interested in the limit of interactions via long wavelength excitations 
we see that U (the effective potential in k-space) will be equal to 

U = = T(0,0). (53) 

m 

The energy shift SE ~ (\&|C/(r)|\I>) will then be (sum over all pairs of scatterers): 

^~^£l*fo->°)| 2 - ( 54 ) 

Zi ill 

To illustrate the dependence of a on short distances, we take a crude model of the Van der 
Waals potential 

K(r)=f~ T (55) 

[-fx r>r c 

which reproduces the correct large-distance behavior. The scattering equation for the radial 
wave function is 



X» + 



e _ l l±H _ 2m r U(r) 



X (r) = 0, (56) 



where m r is reduced mass, and for k = I = and r > r c we will have 



+ ^ x(r ) = 0. (57) 
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So x a t large distances will be of the from \ — c i r + °i an d if we define r = (2m r «) 1 / 4 
(check dimensions!) and x = r/r we end up with a relatively simple eqn. 

X"(x) + \ X (x) = 0. (58) 

The solution of the above differential equation with the boundary condition that x should 
go to zero at r c is 



x (x) = Ax" 2 



^ J 1/4 (^) J - 1/4l 2^ 



(59) 



where x c = r c /r . The asymptotic large-r behavior is extracted using the Bessel-function 
asymptotic formula 

JJz) ~ — ^ r (60) 

pv ; 2pr(i+p) v ; 

and the result is 

x ^ A vww) +B fmk (61) 

By definition we therefore get 

r(3/4) cos(r 2 /2r c 2 - vr/8) 

°2r(5/4) cos(r2/2r c 2 - 3vr/8) ' 1 ' 

The following remarks are in order: 

1) The scale is set by r 

2) a depends on the details of the short-range part of U (r) via r c 

3) a changes sign as r c is varied 

4) Divergence of a means a bound state. 

5) The sign of a for fixed r c depends on the energy of the highest bound state 

2.8 Gross— Pitaevskii equation 

The effective interaction at low energies is 

C/o = — (63) 
m 

a constant in momentum space so a delta function in coordinate space 

U( r -r') = U 5(r -r'). (64) 
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We will follow the Hartree procedure and assume that the states are symmetrized products 
of single-particle wave functions. The ground state is a condensate which means that all 
bosons are in the same 0o( r ) state. Notice that this takes care of the exchange symmetry 
the system should have. So the many body wave function will have the form 



N 



*(n,-- - ,r N ) = Y[Mn) 



(65) 



and the Hamiltonian 



N r 9 

n_ 

2m 



E 

i=i 



(66) 



i<j 



The ground state energy expectation value (Problem set) is 



E = (V\H\V) = N J d 3 



2m 1 



+ ^(r)|0 o | 2 + 



N 



-U, 



0<P0 



(67) 



The error we make in this approximation is that we assumed iV atoms in the condensate. 
In fact, interactions put some atoms in excited states, however it turns out that this is 
0(na 3 ) x / 2 = 0(a/r s ) 3 / 2 which for a <C r s is very small. The typical Alkali scattering length 
is 30 — 100 A. The densest gas obtained so far is n ~ 10 15 cm' 3 which gives 



r\ ~ 10~ 5 cm = 10 3 A. 



(68) 



So the corrections are small, 0(1-10%). 

Since we have iV atoms in the same quantum state it is tempting to define a single 
macroscopic wave function 

^(r) = iV o 1/2 0o(r). (69) 

Do not confuse ip(r) with the many-body wave function \l/(ri, • • • , rjv)- The normalization 

of i])(r) is 

<i 3 r|-?/>(r)| 2 = Nq = N (if T=0 in Hartree approximation) (70) 



so we have 

n(r) = \4){r)\ 2 (71) 

and ip(r) is called the order parameter. 

Formally this can be given precise meaning. From the one-particle density matrix 



p(r,r') = N J dr 2 - • ■ r N ^*(r,r 2 , ■ ■ ■ ,r N )^(r',r 2 , ■■■ ,r N ) 
= (&\r)^(r'))(m 2nd quantized form). 



But p is hermitian so 



p(r,r') = ^2niX*(r)xi(r') 



(72) 
(73) 
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so in BEC only one term of 0(1) (except in the fragmented BEC case) will survive in the 
sum and 

p(r,r') = Nxl(r) X o(r') = ^(r)^(r'). (74) 

To produce the "Gross-Pitaevskii equations" we have to find the opimal ip using the variation 
principle. This gives the result 

=0^ (75) 

dip 

h 2 

-— V 2 ^ + ^ + WlV = J#- (76) 
2m 

We see that for a uniform gas case = and V = and the solution becomes 

H = Wl 2 = nU . (77) 

For a trap the spatial extent of the cloud (= R) defines the scale of the kinetic energy 
h 2 /2mR 2 and the potential energy muj 2 R 2 /2. The typical size without the interactions is 
the harmonic oscillator length a osc = [hjmuj^) 1 ! 2 . So a repulsive U adds a +U ^ term 
and shifts the minimum to larger R. The kinetic energy term becomes less important as a 
result of that shift. For the strong-coupling case U, V >> kinetic energy. When can U, V 
be comparable? 

w> = ^ (78) 

R~ (^V* ~ (S*Y>* a J — V /5 »a„«. (79) 
V. muj 2 J \a osc J 

This approximation is called Thomas-Fermi approx. So most experiments are performed in 
the regime where the kinetic energy term can be neglected, which means the clouds must be 
sufficiently large. Let us solve the Gross-Pitaevskii equation neglecting the kinetic energy 
term 

r + (' () |<f v - /«.;. (80) 
The spatial dependence is driven by V and the solution of course is 

n(r) = H 2 = [n-V]/U Q . (81) 

The "boundary of the cloud" is given by V(r) = /i in this approximation. Obviously it fails 
in some surface layer of thickness S. Since the energy (V + Uo\ip\ 2 ) involves the mean density 
(|-0| 2 ) this is called the Thomas-Fermi approximation for the Gross-Pitaevksii equation. For 
the harmonic trap the size is given by 



Ri — \ l — ^-n- (82) 
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What about if the condensate is supressed locally for example by a hard wall? Then it 
"heals" over a "coherence length" £. So let us forget the external field and compare the 
kinetic energy term with U near the wall. We see that 

£p -<*>,*■ (83) 

2 fi 2 1 
^ 2mnU 5-nna ^ ^ 

For most systems we have a < ^ -C ii. 
2.8.1 Vortices 

The time-dependent Gross-Pitaevskii equation will allow for vortex solutions with quantized 
circulation. These vortex excitations in the condensate can be created in practice by a 
"laser paddle" , simply inserting a laser beam off-axis and "stirring" . The analogy with the 
Schrodinger equation is straight-forward and we can write 

"2^VV + Vj> + tfoM V = ih ^- (85) 

If we mutliply by ip* and subtract the complex-conjugate equation from it we end up with 
the current conservation equation 

^ + V • (nv) = 0. (86) 
If we define ip = fe % ^ and since n — |V>| 2 we find 

v = -V0. (87) 
m 

So v is the gradient of a phase which means that V x v = unless the phase is singular. 
Notice that V = in cylindrically symmetric geometry. 

Quantization of Circulation: The single valuedness of ip requires 

(f) v ■ dl = < — / d(b—(b > = — (where I is integer) (88) 
Jc [m J d(b ) m 

" v ' 

e.g., if C = circle (89) 

Notice that we can define a "quantum of circulation" which is ^ (by Feynman and Onsager). 
Notice as well that a simple aximuthal flow corresponds to 

V 2nr = — =>• (90) 
m 

hl 1 

v = - . 91 

2nm r 
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N.B. The kinetic energy diverges at r = so ip must vanish there. 
The total L = Nlh which corresponds to L/particle = Ih. In general this is not true 
for asymmetrical flows but the overall circulation is still quantized. 



The Gross-Pitaevskii equation in cylindrical coordinates for ip = fe^ is 



2m 



pdp\ dp y 



dz 2 



+ 



h 2 



Imp- 



:l 2 f + Vf + U f = pf. 



(92) 



We examine 1st the I = 1 vortex with no trap potential. If we would guess the length scale a 
priori it should be the coherence length £ since ip — > at the center. Let us scale all lengths 
with respect to £, and define x = p/£ and also scale / by the known solution /o = a/ p/Uq 
for the rotation-free system, and put X — f/fo (note this \ ls n °t the radial wave function!). 
The equation will transform to 



x dx \ dx 



x^ 



x = o. 



At large distances we have 
At short distances we have 



X 3 -X = 0^X=1. 



d ( dx , 

dx \ dx ' 



(93) 



(94) 



(95) 



The energy density will be (in general, again-put back V, arb. 



2m 



df 
dp 



+ 



df 

dz 



2-1 



h 



2^7 + Vf + Y ; ' 



(96) 



where / is now the solution to (95). If we set V 
vortex will be 

" h 2 fdf 



now, the energy/unit length of the I 



Jo 



2npdp 



U, 



M 



r 



(97) 



2m \dp J 2m p 2 ' 2 

where we have introduced the cutoff b to take care of the logarithmic divergence of the 
integrand (1/p 2 term). Note the vanishing of the solution / at p — > regularizes the integral 
there. We are interested in the extra energy the vortex introduces into the system so we have 
to subtract off the "background" . Notice that this is not quite done by simply averaging 
the I 2 1 ' p 2 term over all b, as you might think, since the asymptotic / is actually different 
(for fixed N) since the vortex pushes the condensed particles out from the center. A careful 
subtraction gives 

h 2 ( b\ 

Cvortex = ™— log ( 1.464- J . (98) 



Some remarks are in order. 
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1) The cutoff introduced for the divergence is the minimum of all the length scales of the 
system larger than £ (system size, inter-vortex spacing). 

2) If we redo the calculation for I ^ 1 we find 

h 2 fb\ 

tvortex ^ t 2 ™— l0g ( £ ) ' 

in other words it is always energetically favorable to put in many singly-quantized 
vortices. 

Vortex in trap 

The definition of £ will be — A* an d by the Thomas-Fermi theory /x = V(R) = ^muj 2 R 2 . 
So if we combine those two we will get 

A similar calculation of the energy gives 

h 2 ( R\ 
e v — log ( 0.888- J (101) 

as expected the size of the cloud cuts off the divergence. 

2.9 Remarks on conservation of angular momentum 

The Hess-Fairbank experiment (see Leggett's discussion) is the following. We start with a 
system of bosons rotating above T c and wait for equilibrium to be established, assuming the 
annular container has thickness d <C R. There is a critical angular velocity to stabilize the 
vortex with one quantum of circulation in this geometry. The energy of N particles of mass 
m (total mass M = Nm) rotating with angular velocity u should be equal to the energy of 
N particles each with angular momentum h: 

N-muj 2 R 2 = 



2 2MR 2 2mR 2 

= (102) 
mR z 

If the system rotates at uo uo c and we cool it down below T c , the superfluid stops rotating 
in the lab frame because the angular momentum of it must be quantized and the closest 
quatum state is the / = 0. 

Question: What happens to the "lost" angular momentum? In the 4 He experiment 
it is transferred to the bucket, which actually speeds up! In BEC with no fixed container, 
the angular momentum can be taken up by elementary excitations of the condensate cloud 
of thermal nature (T > 0), or by the center of mass motion. In the Hess-Fairbank case, 
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since uj <C oo c , the center of mass motion can not be excited since it carries L ~ NH. Surface 
waves though, can carry away any number of h quanta. In general, the state of the system is 
the combination of the above that minimizes the energy i.e. it can be calculated. (Analogy: 
the ground state in nuclear physics.) 
Other alternatives: 

a) Evaporation. 

b) Circular polarization of optical trap photons. 

In general the combination of these excitations which minimizes the ground state 
energy at fixed L is complicated (see "yrast" states in nuclear physics). 

3 Interference of two condensates 

Andrews et al, Science 275 , 637 (1997) 
Precis of experiment: 

1) make trapped condensates by usual methods 

2) separate with laser beam into two 

3) release at t — and observe the overlapping condensates at time t with phase-contrast 
density measurements 

If the two clouds may be thought of as two phase-coherent sources separated by d, 
we can expect interference fringes 



where ipi^ are the single particle wave functions which we can assume initially to be Gaussians 
of size Rq. Then they spread as 




(103) 




(104) 



and 




(105) 



Density profile We have 



n 



(r,t) = M 2 = iVi|^i| 2 + N 2 \ip 2 \ 2 + 2 v / AyV 2 Re[y9 1 y?;]. 



(106) 



The interference term will be proportional to 




(107) 
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the first fringe is when —-^^t = 2ir which means 

m H K t 

mR^Rf ht .„ , „ 

Ar = 2 *rf = ;ss <*» large '> < 108) 

which is observed by the experiment! But if clouds are well separated (tunneling time 
estimated to be > age of universe!), we should consider states with definite particle number 
Ni, N 2 (more a priori realistic) we get a Fock state of the form 

= 7mm (a ' )N ' (a> ^ 0) (109) 

where a\ creates a particle in (pi(r). Again we can calculate the density by using the field 
operator 

a\ = / d 3 ripi(r)^(r) (110) 



and 

^(r)\N u N 2 ) = v^ViWIiVi - l,N 2 ) + y/N2(p 2 (r)\N u N 2 - 1) (111) 

and we will get 

n(r) = (N u N 2 \^(r)^(r)\N u N 2 ) 

= iVi|y?i| 2 + N 2 \if 2 \ 2 (no interference), (112) 

which means the sources are "totally incoherent". What is going on? n(r) is the expectation 
value of the density operator i.e. the average over the result of many measurements. If we 
measure only once, as in the experiment we pick out one of the possible eigenvalues of the 
operator ^(r)^(r) with some probability. 

In condensed matter we usually assume it is OK to replace averages over repetitions 
of the experiment by averages over all atoms. For example, if we measure the density of 
a crystal 20 times we will get the same result, with small 0(1 /N) quatum fluctuations, 
unobservable. This assumption can fail if the system is highly correlated, for example in a 
Bose condensate! 

To detect interference in an experiment which averages over many runs, we should 
look at the operator which measures the fluctuations, namely 

(N u JV 2 |n(r)n(r')|JVi, N 2 ) = (A^i| 2 + 7V 2 |^ 2 | 2 )( Wi| 2 + N 2 \ip' 2 \ 2 ) - A^l >'i| 2 

- iV 2 |y9 2 | V 2 | 2 + 2N 1 N 2 R4<p* 1 '<p 1 <p* 2 <pf 2 ]. (113) 

V v ' 

interference 

Alternatively, in a "one-shot" measurement like Andrews et al. (only one cloud is split and 
the density profile measured after release), one should measure an interference pattern in a 
density measurement. Our claim is the following, if the experiments were repeated many 
times, we would get fringes each time, but with arbitrary relative phase, so the expectation 
value would then cancel such fringes. 
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1 Josephson Effect 



In 1962 Brian Josephson 1 , then a 22-year old graduate student, made a remarkable predic- 
tion that two superconductors separated by a thin insulating barrier should give rise to a 
spontaneous (zero voltage) DC current, I s = 7 c sinA0, where A<p is the difference in phase 
across the junction. And that if a finite (DC) voltage were applied, an AC current with 
frequency u = 2eV/H would flow. I c is called the Josephson critical current. 

There is a myth that Brian Josephson did his calculation (1962) and won the Nobel 
prize (1973) as part of the solution to a homework problem of Phil Anderson's. The truth is 
that Anderson was a lecturer on sabbatical at Cambrige in 1961-62, and he gave a series of 
lectures in which he mentioned the problem of tunneling between two superconductors, which 
Josephson then promptly solved. The idea was opposed at first by John Bardeen, who felt 
that pairing could not exist in the barrier region 2 . Thus much of the early debate centered on 
the nature of the tunneling process, whereas in fact today we know that the Josephson effect 
occurs in a variety of situations whenever two superconductors are separated by a "weak 
link", which can be an insulating region, normal metal, or short, narrow constriction. 3 

Let's first consider the last example as the conceptually simplest. The Ginzburg- 
Landau equation appropriate for this situation may be written 



where £ = y 2 m*a(r) * s tne coherence length and f(x) = ^(x)/^^. Take L <C £, so the 
deviations of \1/ coming from the bulk value ^ of the first SC is small, and vice versa for 
the second SC. Changes of ^ in the constriction occur over a length scale of L, so that the 
first term is of 0((£/L) 2 ) 3> / — f 3 . So we must solve a Laplace equation for /, = 0) 

iphys. Lett. 1, 251 (1962) 
2 Physics Today, July 2001 

3 In Je. 2009 I received an email from Brian Josephson correcting this version of the history: 

Date: Wed, 10 Jun 2009 09:43:54 +0100 
From: Brian Josephson <bdj 10@cam. ac .uk> 
To: pjh@phys.ufl.edu 
Subject: the Josephson myth 
Dear Peter, 

While browsing I came across your mention of the 'myth' that I discovered the effect 
because of a problem set by Anderson. Your correction is not completely correct 
either! It was Pippard, my supervisor, who drew my attention to Giaevar's tunnelling 
expts. and his theory, which started me thinking (especially as to how one could 
get away without using coherence factors) . Anderson on the other hand told me of the 
Cohen/Falicov/Phillps calculation involving a single superconductor when it came our 
in PRL, which gave me the idea of how to do the two-sc. case. Previously I had got 
the broken symmetry idea which was crucial from a number of papers including Anderson's 
pseudospin model, and also expounded in his lecture course which I went to. 

Best regards, Brian J. 
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with B.C. /(O) = 1, f(L) = e* A *. The solution will be 



/ 



1 



L 



JA</> 



L 



(2) 



The solution can be thought of as two terms, the first beginning to "leak" into the 
constriction from the left, the second \&2 leaking into the constriction from the right. The 
GL expression for the current will be 

e*h e* 2 

- ^W*) ^*^A 



2m*i 
2m*i 00 



e*M> 2 

sin A0, 



nrc 



+ -e- iA <t> ) ( - - + -e^ ) - 



c.c 



m*L 

which means that the current will be 



I — I c sin A0, 
m*L 



(3) 

(4) 
(5) 



where A is the cross-section. 

Given that we have two weakly coupled QM systems, it is "not unreasonable" (justi- 
fied by microscopic theory) to write down coupled Schrodinger equations 



3*1 r, t 

at 
M 2 



dt 



= E 2 ^2 + O&i 



(6) 
(7) 



where H^^i = E^i and E\ = E 2 = E if the superconductors are identical. Take \^i\ 2 to 
be the density of pairs in SCj 



1 



2 x fn 



ni 



n 2 
2JE~ 2 



+ iy/r^h = -iE 2 ^T 2 - ia^n~ie i{<tn - M . 



(8) 
(9) 
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Figure 1: 



If we take the real parts and use hi = — n 2 we get 

— = a v /n^sin(0 2 - <t>i) 

= a v /nTsin(0 1 - 2 ) =>• 
hi = 2a- v /nin 2 sin(0 2 - 0i), (10) 

Note I've put V=A=1. Then the current is just j = 2ehi. If we take the imaginary parts 
we have 



^i0i = — Ei \fn~i — ot\fri2 cos(0 2 — <f>i) (11) 
= —E 2 ^/h~2 - a v /nlcos(0i - 2 ), (12) 



and by subtracting and assuming rii ~ n 2 (let's couple 2 identical superconductors at first) 
we get 

0i - 02 = #2 - = 2e(Vi - V 2 ) (13) 

where for the second equality we used the fact that the potential difference between the 
superconductors shifts the pair energies by — 2eV. So we see that a finite voltage difference 
leads to a time changing phase difference A0 which means an AC current via Eq. (10). 

Magnetic fields. Now put a flux through the junction where the B field is along the 
— y direction and A = —Bxz. The phase of the wave function \1/ must change by 

0^0- — [dS-A (14) 



c 

for the theory to be gauge invariant. Notice that is now space-dependent. So the Josephson 
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Figure 2: 



equations will read 



j = 4eay/ni7i2 sin I A0 



2e 



dS ■ A , 



Jc 



2e(Fi - V2) 



at 



2e 



dS • A , 



and since 



we will have 



J S • A = J dz(-Bx) = -Bxd, 



(15) 
(16) 

(17) 



J 



/ day (a;) 
Jo 



2e 

dxjcSin ( A$ 5xd 



2tt$/$ 



cos A$ - cos A0 + 



2tt$ 







(18) 



where <3>o — ^p- What is the maximum current through the junction for all possible A0? 
We have to calculate = which leads to the relation 



tan A$ = cot (7r$/$ ) 
and with a bit of tedious trigonometry to 

sin (7T$/$ ) 



(19) 



(20) 



7T$/$ 

This formula produces the Josephson-Fraunhofer interference pattern. DC SQUID (Super 
conducting Quantum Interference Device) 
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We ignore resistance and capacitance for now. The inside SC thickness is assumed much 
greater than A and since v s = we have V0 = 2A/$ . The flux will be 



<[> = d> ds ■ A = / ds ■ A + — / ds ■ V0 + / ds-A+—[ ds • V0 



2 

= y(03-0 2 ) + y(0i-0 4 )+ / ds-A+l ds-A 

= Y(01-02)+/ C?S • A + y(0 3 - 4 ) + / rfs-A 

v — v i ' v ' 

=—712 =—734 

$ = 743 - 712- (21) 

The Josephson current through the SQUID will be 

J = J c (sin7 12 + sin 743) 
= J c (sin 7l2 + sin(7 12 + $)). (22) 

Which means that the current oscillates with the flux. And as a result of that the SQUID 
can be a sensitive measure of magnetic fields. In practice we include the capacitance and 
resistance of the device. 
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